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ON  SOME  STOCHASTIC  INEQUALITIES 
INVOLVING  MINIMUM  OF  RANDOM 
VARIABLES* 

Peter  Kubat 

The  Graduate  School  of  Management 
The  University  of  Rochester 
Rochester,  New  York 

ABSTRACT 

Let  X/  be  independent  IFR  random  variables  and  let  K,  be  independent  ex¬ 
ponential  random  variables  such  that  E[Xt]  -  £[Y(]  for  all  i  -  1.2 . n. 

Then  it  is  well  known  that  £[  min  (X,)|  >  £1  min  (K,)).  Nevertheless,  for 
U/<«  !«/«» 

exponentially  distributed  K/s  and  for  a  decreasing  convex  function  it  is 
shown  that 


£[<J>(  min  (*,»]<£!•(  min  (K,»l. 
!</<» 


1.  INTRODUCTION 

The  bounds  for  the  mean  lives  of  series  and  parallel  systems  under  various  assumptions 
on  component  life  distribution  were  extensively  treated  in  the  well-known  book  by  Barlow  and 
Proschan  [3]. 

In  particular,  if  we  denote  by  X,{Y,)  the  life  length  of  the  Ah  component  in  the  first 
(second)  series  system,  then  the  following  proposition  holds: 

THEOREM  1:  (Barlow  and  Proschan  [3,  p.  122]).  Let  X,(Yt)  have  continuous  distribu¬ 
tion  F,(G,)  with  the  mean  n,.  Let  X\ . X„  (Ki .  Yn)  be  independent  and  F,  <  Gj, 

I  «■  1,  ....  n,  i.e.,  F,  is  star-shaped  with  respect  to  G,[F,  <  G,  iff  (1  /x)G~'  F,(x)  is  increasing 

for  x  ^  0).  Then  the  mean  life  of  a  series  system  using  components  with  lives  JTj . Xn  is 

greater  than  the  corresponding  system  mean  life  using  components  with  lives  K, .  Y„,  that 

is, 

(1.1)  £[minW,,  X„)]>  £lmin(K . .  Y„)i. 

In  this  note,  it  will  be  shown  that  for  F,  having  Increasing  Failure  Rate  (IFR)  and  G , 
being  exponential  we  get 

(1.2)  £(<fr(min(Jf| . Xn)]  <  £[4>(min(F1 .  Y„) I, 

•This  work  was  supported  in  part  by  the  Office  of  Naval  Research  under  grant  No.  CNA  SUB  N00014-76-C-OOOI. 
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provided  that  <!>(■)  is  decreasing  convex.  A  special  case  of  the  inequality  (1.2)  was  already  dis¬ 
cussed  in  [2]  where  bounds  for  the  discounted  mean  of  the  series  system  life  was  considered. 

2.  BOUNDS  FOR  THE  MEAN  OF  A  FUNCTION  OF  LIFE  OF  THE  SERIES  SYSTEM 

Before  we  prove  the  main  result  the  following  two  lemmata  are  needed. 

LEMMA  1:  Let  Y,  Fh  i  —  1,2,  be  two  positive  continuous  r.v.’s  such  that  F\  <  F2 
and  £[^1  -  E[X2 1.  Then, 

(2.1)  £M>(Y,)1  s;  £[4>(Y2)] 
for  any  convex  and  bounded  function  <!>(•). 

PROOF:  For  i  -  1 , 2  we  have 

£[<!>(*,)]  -  fQ°°  <t>(x)dFi(x) 

-  -4> (*)/;(*) [“  +  /o“  0(x)  Fix)  dx 

where  we  denoted  0(x)  -  4>'(x)  and  Fix)  -  1  -  Fix).  Since  <&(■)  is  bounded  then 

-4>(x)F(x)  I  -  c  is  finite.  Moreover,  since  $(•)  is  convex  then  0(x)  is  increasing  and  thus 
all  the  conditions  of  Lemma  6.4  [3,  p.  112]  are  satisfied  and 

fQ  0(x)£,(x)  d!x  <  JQ  fliix)F2ix)  dx. 

The  rest  of  the  proof  follows  easily.  □ 

LEMMA  2:  Let  Z  -  Fzix)  -  1  -  e~Kx  and  X  ~  £,(x)  -  1  -  e~»x  with  £[Z]  >  E[X\. 

Then  there  exists  a  unique  point,  say  x0,  where  the  density  of  X  crosses  the  density  of  Z  and 

the  crossing  is  from  above. 

PROOF:  £[Z)  >  £[Y]  implies  n  >  A.  Solving  lie  *x°  yields  x0- 

ifi  —  A)-1  ln(/x/\)  >0.  □ 

Now  we  are  ready  to  prove  the  main  theorem. 

THEOREM  2:  Let  X,  ~  £,  IFR  and  Y,  —  G,  with  G(  being  exponential  and  £(Y,1  - 
E[Y,\  for  i  -  1,  . . .  ,  n.  Let  <!>(•)  be  decreasing  convex  bounded  function  on  [0,°o).  Then 

(2.2)  £[4>(min  Y,)]  <  £[4>(min  Y,)]. 

I  i 

PROOF:  Denote  by  X  -  min(^,).  Let  X  —  F,  then  F  is  IFR  and  there  exists  an 

exponential  r.v.  Z  with  cumulative  distribution  function  (c.d.f.)  G  such  that  £[Y]  —  £(Z]. 
Furthermore,  F  <  G  [2,  p.  107], 

By  the  Lemma  1  we  have  £[4(JT)]  <  £(4>(Z)],  and  by  Theorem  1  we  have 

£tZ]  >  £[K],  where  F-min(K1 .  K„).  Note  that  Y  is  also  exponential.  If 

£lZ]  -  E[Y)  we  are  done. 
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If  E\Z\  >  £iy],  then  there  exists  a  unique  point  x0  where  the  density  of  Y  crosses  the 
density  of  Z  and  the  crossing  is  from  above.  Since 

£(<i>(Z)]«  f°°  <J>(x)\<>-AJr  dx 
*'0 

and 

£l4>m]  -  f°°  d>(x WMXdx 

•'O 

we  get 

(2.3)  £[4>(Z)1  -£[«!><  K)]  -  /0°°  d»(jr)  (\e~kx  -  ne^)  dx 
which  can  be  written  as 

(2.4)  J  [<&(x)  —  <J>(x0)]  (\e~kx  —  lie'11*)  dx  +  <J>(x0)  J*Q  (Ac-**  —  /ue_MJt)  dx. 

The  second  term  of  (2.4)  is,  of  course,  equal  to  zero  and  the  first  term  written  as, 

0  —  ld>(x)  —  <J>(x0)]  (Ac-**  —  ne'**)  dx 
•f  f "  l<t>(x)  ~  <P(x„)l  (ke~Kx-  fie'MX)  dx  <  0. 

JUg 

is  nonpositive,  since  for  x  €  [O.xol,  <t>(x)  -  <f>(x0)  ^  0  (<!>(*)  is  decreasing)  and 
\e~Kx  -  fie~,xx  <  0  (by  Lemma  2).  Similarly,  for  x€  (x0,«>),  4>(x)  -  <J>(x0)  <  0  and 
\e~Kx  -  !xe^x  >  0.  D 


When  the  function  <!»(-)  is  exponential  we  obtain  the  following  corollary: 

COROLLARY  I:  Let  X,  —  F(  IFR  and  Y(  —  G,  exponential,  for  i  -  1 . n, 

£[X,1  -  £[  K,]  -  I /fir  Then  for  4>(x)  -  e~ax  we  get 

2>, 

£[exp{-a(min  X,)\\  <  — — . 


Finally,  let  us  remark  that  Theorem  2  will  hold  even  when  F,  is  Increasing  Failure  Rate 
Average  (IFRA),  since  this  condition  together  with  G,  being  exponential  still  implies  that 
£  <  G  13,  p.  1071. 

3.  EXAMPLES 

Two  simple  examples  will  illustrate  the  theorem. 

EXAMPLE  1:  Consider  an  n-unit  series  system,  where  the  unit  lifetime  X,  —  £,.  The  £, 
is  IFR  and  E[X,\  -  1/m,.  When  the  system  fails  it  is  replaced  by  a  new  system.  The  cost  of 
the  new  system  at  the  time  /  -  0  is  C.  The  expected  discounted  cost  of  replacement  at  the 

time  of  system  failure  is  CE[e~aX 1,  where  X  —  min(Y| . X„)  denotes  the  life  time  of  the 

system. 
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It  follows  from  Corollary  1  that  the  upper  bound  for  the  discounted  replacement  cost  is 

c  — - - . 

«  + 

/ 


Suppose  now  that  F ]  is  Gamma  (X,.a,),  a,  >  1,  for  all  / 
distribution  function  (p.d.f.)  of  X,  is 


fSx) 


kA/x)"'  -Kx 

-  p  l 

r  (a,) 


x  >  0,  X,  >  0. 


1 . rr,  i.e.,  the  probaoility 


In  this  case  Fs  is  IFR,  E[X,\  —  ajk,  and  thus  /tty  -  X,/a(.  Moreover, 
C£le-“*)  <  C/WA  +  1), 
where  A  -  £  (Xf/a,-). 


EXAMPLE  2:  Let  X  —  min(X| . X„).  All  X,' s  are  independent,  have  IFR  distribu¬ 

tion  F,  and  FIX’,]  =  l//x,.  Assume  that  4>(x)  —  (ax  +  /3)_l,  a  >  0,  /3  >  0.  Clearly,  <t>(x)  is 
decreasing  convex  and  bounded  on  (0,<»).  It  follows  from  Theorem  2  that  in  this  case 


flow)] 


FKaX-b/S)-'!  <  f“ 
J  0 


1 

ax  +0 


\e~Kx  dx 


e“  E{(d). 


where  A  —  —  A/3/a  and  £t(rf)  denotes  the  Exponential  Integral.  The  numerical 

/ 

values  of  £,(</)  are  tabulated  in  [1]. 
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ABSTRACT 

An  availability  measure  is  the  probability  that  a  two-slate  system  modeled 
by  an  alternating  renewal  process  is  available  at  one  or  more  points  or  intervals. 
The  concept  of  availability  measures  is  extended  to  formulae  for  the  joint  pred¬ 
iction  of  availability  and  numbers  of  breakdowns  (or  repairs)  of  the  system 
during  a  fixed  interval. 


1.  INTRODUCTION 

Consider  a  two-state  system,  i.e.,  a  machine  subject  to  stochastic  failure  and  repair,  and 
suppose  that  the  breakdown/reactivation  cycle  of  the  machine  can  be  modeled  by  means  of  an 
alternating  renewal  process  (e.g.,  Cox  l4l,  Chapter  7).  Under  this  assumption,  we  can  derive  a 
variety  of  useful  formulae  for  predicting  the  reliability  of  the  system.  In  particular,  we  can 
determine  the  distributions  of  the  Sit),  the  numbers  of  failures  and  repairs  in  a  fixed  interval 
[1],  and  we  can  obtain  availability  measures,  probabilities  of  the  form  pi/it)  -  1  V  t  €  T) 
where  lit)  -  1(0)  if  the  system  is  operating  (failed)  at  t  and  where  Tis  an  index  set  compris¬ 
ing  a  (finite)  series  of  points  or  intervals  12],  The  dependence  between  the  Sit)  and  the  /(/) 
precludes  the  simultaneous  prediction  of  numbers  of  failures  (or  repairs)  and  availability  using 
existing  theory.  In  this  paper,  we  present  a  series  of  formulae  which  generalize  the  availability 
measures,  enabling  us  jointly  to  predict  numbers  of  breakdowns  (or  repairs)  and  availability. 
We  call  these  formulae  compound  availability  measures. 

Availability  measures  are  of  use  in  assessing  the  likelihood  that  a  repairable  machine 
modeled  by  an  alternating  renewal  process  is  available  at  specified  times.  If  the  probability  falls 
below  a  certain  threshold,  arrangements  for  securing  an  adequate  back-up  can  be  made.  The 
distributions  of  the  Sit),  on  the  other  hand,  are  of  use  in  deciding  on  the  allocation  of 
sufficient  repair  facilities  for  a  fixed  time  interval.  The  formulae  presented  in  this  paper  enable 
us  to  consider  these  probabilities  simultaneously. 

In  Section  2  we  examine  the  simplest  case  and  consider  probabilities  of  the  form  p{l(t)  — 
1,  Sit)  -  j }  and  p{Hu)  -  1  V  u  €  [/,/  +  x],  Sit)  -  j }.  In  Section  3  we  consider  probabili¬ 
ties  of  the  form  p[lit  +  x)  —  1,  Sit)  -  j\,  and  in  Section  4  we  consider  probabilities  of  the 
form  p{Ht)  -  1,  Nit  +  x)  -  Nit)  -  j).  Some  numerical  examples  for  the  Weibull/gamma 
process  are  given  in  Section  3.  It  is  first  necessary  to  introduce  some  notation  and  briefly  to 
review  some  relevant  results. 
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Let  F  and  G  denote  the  distribution  functions  of  the  failure  times  and  repair  times, 
respectively,  and  let  their  Stieltjes  convolution  be  denoted 

Kit)-  F  *  Git)-  ('  Fit-  u)  dG(u). 

Jo 

Further,  the  fl-fold  recursive  Stieltjes  convolution  of  K  is  written  K{n)  for  n  —  1,2,3,  ....  and 
we  define  A<ol(r)  —  HO)  if  t  >  (<)  0.  Let  /,  #and  k  be  the  densities  corresponding  to  G 
and  K ,  respectively.  The  renewal  counting  functions  of  the  numbers  of  failures  and  repairs  in 
(0,rl  are  denoted  N\it)  and  N2il)  (jV2(r)  and  /V3(r)),  respectively,  assuming  that  there  is  a 
repair  (failure)  at  time  0.  The  corresponding  renewal  functions  and  renewal  densities  are 
//,(/)  -  E(/V,<f))  and  h,0)  -  dH,it)ldtii-  1,2,3). 

The  distributions  of  the  N,it)  are  as  follows: 


(1.1) 

(1.2) 

(1.3) 


p{Ntit) 

p{N2it) 

p{Niit) 


|  1  -  Fit) 

"'“If*  *<"-|,(r)  -  f  *  Ktn'(t) 

n\  -  K'n,it)  -  Kin+]iit) 
j  1  —  Git) 

n]  “  1  G  •  K("-Uit)  -  G  •  K{"'it) 


n-0 
n  ^  1 

n  >  0 
n- 0 
n  >  1 


(see  Barlow  and  Hunter  [1  ]>.  The  point  availability  of  the  two-state  system  is  defined  as 
Akit)  —  p{lkit)  —  1}  where  k  —  0(1)  if  there  is  a  failure  (repair)  at  time  0.  The  interval  avai¬ 
lability  is  defined  as  Rkix,t)  —  p[lkiu)  —  1  Vw  €  [t.t  +  *]).  It  can  be  shown  that 


(1.4)  Ax(t)  -  Fit)  +  F  *  H2it) 

(1.5)  A0it)  -  Git)  -  G  •  H2it) 

(1.6)  /?,U,r)  -  Fit  +  x)  +  Jo  h2iu)  Fit  +  x  -  u)  du 

(1.7)  R0(x,t)  -  fQ  h2iu)Fit  +  x  -  u)  du 
where 

Fit)  -  1  -  F(r)[2l. 


2.  COMPOUND  MEASURES  OF  lit)  AND  Nit) 

The  expressions  for  the  simplest  compound  availability  measures,  i.e.,  p{lit)  —  1, 
Nit)  —  y'},  are  of  particular  interest  as  surprisingly  simple  formulae  can  be  obtained  for  the 
covariances  of  the  /*(/)  and  the  Ntit). 


(2.1) 

p{/,(r)-  1.  7V,(r)  -y)  - 

F  •  K^'it) 

j>Q 

(2.2) 

pUiit)  -  I,  N2(t)  -j)  - 

F  *  K{J)it) 

j>0 

(2.3) 

p{l0it)~  1.  \2it)-j)- 

F  •  G  •  K{j)it) 

j>0 

(2.4) 

p{l0it)~  1,  A3(f)-y}« 

1- 

\  F  *  G  •  K'J-'>it) 

y-o 

J  >  i. 
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Observe  that  />{ /j (/)  —  1,  N\(t)  =  >1  -  p{l\(t)  —  1,  N2it)  •  j)  for  all  j.  This  is  to  be 
expected  in  view  of  the  identity  /,(r)  =  /V2(r)  -  /V,(r)  +  1. 

The  corresponding  formulae  for  P'kix,t,j)  =  p{Ik(u)  —  1  V  w  €  [r,r  +  xl,  N,(r)  =  j]  are 


as  follows: 

(2.5) 

P\  ( x.t,j )  *  i 

1 

Fit  +  x ) 

f  k<J,iu)  Fit  +  x  -  u)  du 
Jo 

j- o 

j  >  1 

(2.6) 

P \  ( x.t.j )  - 

Fit  +  x) 

f  kij)(u )  Fit  +  x  -  u)  du 

Jo 

J-  o 

j  >  1 

(2.7) 

Po  (x.t.j)  -  _ 

f  bjiu)Fit  +  x  —  u)  du 

j  >  o 

(2.8) 

Pq  (x.t.j)  - 

l», 

[  J"fl  bj-.\(u)  Fit  +  x  ~  u)  du 

J-  0 

j  >  1 

where  k(j)it ) 

=  dKij)it)/dt  and 

I  i  sit  -  u)  k[J'iu)  du  j  >  l 
\t)  j-  o. 


It  should  be  noted  that  the  above  formulae,  and  many  of  those  of  Sections  3  and  4,  do 
not  need  to  treat  the  cr  j  =  0  separately,  but  this  has  been  done  in  the  interests  of  clarity. 

The  following  covariances  are  readily  derived  from  (2.1 )- (2.4)  above. 

(2.9)  cov  (/,(/).2V,(/)}  -  A,  •  H2(t)  =  AXU)HXU) 

(2.10)  cov  {/,(/). N2it))  =  A  |  •  H2it)  -  Axit)H2it) 

(2.1 1)  cov  {/0(/).2V2(/)J  -  A0  *  H2it)  -  A0it)H2(t) 

(2.12)  cov  |/0(/),Af3(/)}  -  /I0  *  W2(f)  -  40(t)  lHj(r)  -  lj. 


These  may  be  obtained  directly,  in  which  case  the  identity 
£yA”'’(r)  -  H2  •  H2it)  +  H2(t) 

i-\ 

will  be  found  helpful,  or  by  exploiting  the  binary  nature  of  the  indicator  variable.  Thus,  for 
example,  expression  (2.10)  follows  from  the  identity 

‘  EiNtU)UlU)  -  "  -  eW'm,)  -  01 
and  the  other  covariances  may  be  derived  similarly. 
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Observe  that  the  expectation  of  the  product  of  the  indicator  variable  and  N2(t)  is  the 
Stieltjes  convolution  of  their  expectations. 

3.  COMPOUND  MEASURES  OF  Hi  +  x)  AND  Nit) 


We  now  present  formulae  for  probabilities  of  the  form  p{lk(t  +  x)  ®  1,  A',!/)  =  j}. 
(3.1)  p{/iit  +  x>-  1.  Nlit)=j) 


where 


=  Fit  +  x)  +  J*Q  fit  +  y)  A0(x  -  y )  dy 
=  f  k{J)  iu)  Fit  +  x  —  u)  du 

J  0 

+  J*Q  J*Q  k<j)  iu)  fit  +  y  -  u)  A0ix  ~  y)  dudy 
+  J*0  J*Q  j ( m )  git  +  y  -  u)  A\ix  -  y)  dudy 

a(t)=\Slf{,-u)k{j](u)du 

'  \  fit) 


j  —  0 


j  >  1 


J-  0 


(3.2)  p\lxit  +  x)  -  1,  N2(t)  =j\ 

=  F(r  +  x)  +  J*Q  /(/  +  j')  ^o(*  ~  J')  4v 

+  fX  ('f(u)g  it  +  y  -  u)A ,  (x  -  y)  dudy  j  =  0 

•/  0  0 

«  f  k(j}iu)Fit  +  x  -  u)  du 

Jo 

+  f*  k(j)iu)  fit  +  y  ~  u)  A0(x-y)  dudy 
+  Jg  Jo  ajiu)git  +  y  -  u)  A tix  -  y)  dudy  j  >  1 

(3.3)  p[l0(t  +  x)  =  1,  N2it)  -j) 

=  JQ  bjiu)  Fit  +  x  -  u)  dudy 

+  JQ  JQ  bjiu) fit  +  y  -  u)  A0ix  -  ^)  dudy 

+  f  k{J)iu)  git  +  y  -  u)  A\ix  -  y)  dudy  j  ^  0 

(3.4)  p{I0(t  +  x)  =  \,  N}(t)  -  j] 

=  /0  g(r  +  y)  /<i(x  -  y)  dy  y-0 

-  J"  bj_\(u)Fit  +  x~u)du 
+  J0  /0  bj-\iu)  fit  +  y  -  u)  A0ix  -  y)  dudy 

+  f  f'  k(J)iu)  git  +  y  -  u)  Atix  -  y)  dudy  j>  1. 

•f  0  0 

Note  that,  as  would  be  expected,  expressions  (3. 1 )- (3.4)  degenerate  to  (2.1M2.4), 
respectively,  at  x  -  0  and  that  application  of  the  key  renewal  theorem  shows  that 

lim  p[lk(t  +  x)  —  1,  N/it)  —  j\  ~  — —■ —  p{Niit)  —  j) 

x—oo  Ml  M2 

where  mi  and  p2  are  the  means  of  Fand  G ,  respectively. 
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The  corresponding  formulae  for  p[tkiu)  «  1  V  u  €  [/_  +  x,t  +  x  +  w],  N,it)  =  j]  are 
obtained  from  (3.l)-(3.4)  above  on  replacing  Fit  +  x)  by  Fit  +  x  +  w)  and  Akix  —  y)  by 
Rk  (  h',x  -  .y)  as  appropriate.  Further  extensions  of  these  formulae  to  more  general  compound 
availability  measures  of  the  form 

/>!/*(/,)  -  lk(t2)  =  •  •  ■  =  /*(/„)  =  1,  J V,(r)  =  j)  for  t  <  r,  <  t2  <  ■  ■  ■  <  t„ 
are  similarly  obtained. 

4.  COMPOUND  MEASURES  OF  /(<)  AND  Nit.t  +  x) 

In  this  section  we  consider  a  different  type  of  compound  availability  measure,  that  of  the 
form  p[/k(t)  =■  1,  Nj(t,t  +  x)  =  j)  where  N-it.t  +  x)  *  Nft  +  x)  -  A,(r)  is  the  number  of 
renewals  in  [/,/  +  x]. 


(4.1) 

p{l\(t)  =  1,  N{it,t  +  x)  =  j\ 

=  Rtix,t) 

=  fo  fit  +  u) p{N2ix  -  u)  -  j  -  1}  du 

7  =  0 

+  Jo  X  ^2(3)  fit  ~  s  +  u)  p{N2ix  —  u)  =  j  —  1}  dsdu 

J  >  1 

(4.2) 

p[l\it )  =  1.  N2it,t  +  x)  =  j) 

=  R\(x,t)  +  Jq  fit  +  u)  Gix  -  u)  du 

+  X  J*  h2is)  fit  -  s  +  u)  Gix  —  u)  dsdu 
=  X  fi'  +  u)p\N2ix  -  m)  -  ./')  du 

7-0 

+  f  f  h2(s)  fit  -  s  +  u)  p{S2ix  -  u)  «  j)  dsdu 

J  >  1 

(4.3) 

p(/0(r)  *  1,  N2it,t  +  x)  —  /) 

-  R0ix,t) 

7-0 

—  X  X  —  s  +  u)  p{N2ix  -  u)  —  j  —  1)}  dsdu 

J  >  1 

(4.4) 

p{/0it)  -  1,  Aj(r,r  +  x)  -  j) 

-  /?0(x,r)  +  J"0  fg  h}is)  fit  -  s  +  u)  Gix  -  w)  rferfu 

7  =  0 

—  X  X  hiisi  fit  ~  s  +  u)  p{N2ix  -  u)  -  j)  dsdu 

J  >  1. 

Observe  that,  as  would  be  expected,  (4.1)  and  (4.2)  degenerate  to  (1.1)  and  (1.2),  respec¬ 
tively,  at  /  -  0  whereas  both  (4.3)  and  (4.4)  vanish.  Note  further  that  on  applying  the  key 
renewal  theorem  to  the  above  formulae  we  see  that 

lim  p{l\it)  -  l,  /V|(r,r  +  x)  -  j]  -  lim  p{l0it)  —  l,  N2it,t  +  x)  -  j) 

l—oo  (—*  OO 

A  ¥(x)  7-0 

A  JQ  *!i(u)  p\N2(x  -  u)  -  j  -  \)du  j  ^  1 


VOL.  29.  NO.  3,  SEPTEMBER  1982 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


408 


1  A  BAX I I R 


and  that 

lim  p{/|(t)  -  1,  N2(i,i  +  x)  -  j )  -  lim  p[l0it)  =  1,  +  x)  -  j) 

(—*<30  (—<30 

=■>  A  f  0(t/)  /»{ A^3(jf  —  m)  =  7)  j  >  0 
•/  0 

where  /l  -  am/(mi  +  P2>-  <Mf)  =  F(t)/fi{  and  ¥(/)  =  J*Q  </i(u)  du.  Thus,  we  see  that  the 
asymptotic  probabilities  are  equal  to  the  corresponding  probabilities  for  the  equilibrium  alternat¬ 
ing  renewal  process. 

Generalizations  to  formulae  of  the  form  p{lk(u)  -  1  V  u  €  [t,t  +  jt],  N,(t  +  w,t  + 
w  +  x)  -  y)  are  readily  obtained  by  obvious  modifications  to  (4.1M4.4). 

5.  APPLICATIONS 

One  important  industrial  application  of  the  alternating  renewal  process  is  to  model  the 
sequences  of  failures  and  repairs  of  electricity  generating  plant,  such  as  boilers  and  turbo¬ 
generators.  The  author’s  experience  in  analyzing  such  data  shows  that,  typically,  failure  times 
are  identically  distributed,  as  are  repair  times,  and  that  these  random  variables  do  not  appear  to 
be  dependent.  Knowledge  of  various  availability  measures  is  a  means  of  assessing  the  likeli¬ 
hood  that,  for  example,  peak  demand  will  be  met  over  one  or  more  periods.  The  distributions 
of  the  renewal  counting  functions,  on  the  other  hand,  give  some  indication  of  the  number  of 
breakdowns  likely  in  a  fixed  interval.  These  may  be  used,  for  example,  in  deciding  the  levels 
of  stocks  of  those  spare  parts  which  are  most  commonly  used  in  repair  work.  The  compound 
availability  measures  enable  us  to  make  the  two  sets  of  predictions  simultaneously. 

As  an  example,  we  examine  the  alternating  renewal  process  with  Weibull  failure  times 
and  gamma  repair  times,  the  scale  parameter  being  unity  in  each  case,  i.e., 

/(f)  =  ctf"-1  e~'°,  git)  =  tv~l  e_'/r(r)). 


Values  of 

/>{/,(/)  -  1.  Nlit)  -  j)  -  p[l\it)  -  1,  iV2(f)  -;)-/•  KiJ)it) 

were  evaluated  for  0  <  t  ^  10  and  j  —  1,2,3,  ...  for  a  variety  of  values  of  a  and  r>.  The 
numerical  integration  was  achieved  by  means  of  the  Cleroux-McConalogue  algorithm  [3]  for 
recursively-defined  Stieltjes  convolutions  using  FORTRAN  subroutines  developed  by 
McConalogue  [5],  (See  McConalogue  (6]  for  further  details.)  The  values  of  F  *  K{J)(t) 
obtained  are  illustrated  in  Figures  1-4  for  the  following  combinations  of  shape  parameters: 


Figure 

a 

V 

1 

l 

1 

2 

2.5 

2.5 

3 

4.5 

2.5 

4 

5 

1.25. 

ACKNOWLEDGMENT 

I  should  like  to  thank  the  referee  for  a  number  of  suggestions  which  improved  the  presen¬ 
tation. 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  29,  NO.  3,  SEPTEMBER  1982 


410 


L  A  BAXTER 


REFERENCES 

[1]  Barlow,  R.E.  and  L.C.  Hunter,  "Reliability  Analysis  of  a  One-Unit  System,"  Operations 
Research  9,  200-208  (1961). 

[2]  Baxter,  L.A.,  "Availability  Measures  for  a  Two-State  System,"  Journal  of  Applied  Probabil¬ 
ity  18,  227-235  (1981). 

[3]  Cleroux,  R.  and  D.J.  McConalogue,  "A  Numerical  Algorithm  for  Recursively-Defined 
Convolution  Integrals  Involving  Distribution  Functions,"  Management  Science,  22,  1138- 
1146  (1976). 

[4]  Cox,  D.R.,  "Renewal  Theory"  (Methuen,  London,  1962). 

[5]  McConalogue,  D.J.,  "Convolution  Integrals  Involving  Probability  Distribution  Functions” 
(Algorithm  102),  Computer  Journal,  21,  270-272  (1978). 

[61  McConalogue,  D.J.,  "Numerical  Treatment  of  Convolution  Integrals  Involving  Distribu¬ 
tions  with  Densities  Having  Singularities  at  the  Origin,"  Communications  in  Statistics,  B 
10,  265-280  (1981). 


naval  research  logistics  quarterly 


VOL.  29,  NO.  3,  SEPTEMBER  1982 


A  COMPARISON  OF  SEVERAL  ESTIMATES  OF  AVAILABILITY 


\ 

It, 


!l 


/ 


M.  Mazumdar 

University  of  Pittsburgh 
Pittsburgh,  Pennsylvania 


ABSTRACT 

We  compare  several  compering  estimaies  of  Ihe  availability  of  a  system 
which  alternates  between  two  states,  "up*  and  'down.*  in  accordance  with  an  al¬ 
ternating  renewal  process.  Both  interval  and  point  estimators  are  compared 
under  several  special  but  representative  situations  The  comparison  reaffirms 
the  validity  and  robustness  of  the  log-logistic  iackknifed  estimates  However, 
when  the  point  estimates  are  compared  from  the  intrinsic  criterion  of  probabili¬ 
ty  of  concentration,  the  uniformly  minimum  variance  estimate  obtained  for  the 
Markov  model  performs  very  well 


I.  INTRODUCTION 

In  a  system  which  alternates  between  two  stales,  "up*  or  "down."  in  accordance  wiih  an 
alternating  renewal  process,  the  long-term  point  availability  (often  simply  abbreviated  as  availa¬ 
bility)  measures  the  probability  that  the  process  is  up  at  a  given  instant  of  time  far  enough  in 
the  future.  This  index  measures  the  long-run  expected  fraction  of  time  that  the  system 
operates  satisfactorily.  It  has  been  shown  in  reliability  textbooks,  (see,  for  example.  Barlow  and 
Proschan  [ID,  that  under  not  too  restrictive  conditions,  this  index  equals  the  ratio  of  the  mean 
uptime  to  the  sum  of  the  mean  uptime  and  mean  downtime  Such  two-state  models  are  con¬ 
sidered  frequently  in  the  literature  in  problems  related  to  design  and  maintenance  of  computer 
systems,  communication  systems,  power  plants,  etc.  System  productivity  is  directly  related  to 
equipment  availability,  and,  therefore,  much  attention  is  currently  being  placed  in  the  industry 
on  the  assessment  and  optimization  of  plant  availability.  Any  availability  improvement  program 
has  to  begin  with  the  estimation  of  current  availability.  In  this  paper,  we  examine  several  con¬ 
tending  estimators  of  availability,  and  study  some  of  their  properties. 

In  a  recent  paper,  Gaver  and  Chu  |2l  have  studied  the  behavior  of  the  jackknife  method 
for  estimating  availability  under  several  different  probability  models  for  the  underlying  distribu¬ 
tions.  They  have  pointed  out  that  frequently  in  availability  studies,  it  will  be  difficult  to  obtain 
clear  and  unequivocal  probability  specifications.  Their  results  obtained  by  Monte  Carlo  simula¬ 
tion  show  that  the  jackknife  method  is  reasonably  robust  for  providing  interval  estimaies  and 
valid  under  several  different  special  but  representative  probability  models.  In  this  paper,  we 
carry  out  an  investigation  similar  in  spirit  to  Gaver  and  Chu  12).  Our  point  of  departure  from 
this  paper  is  that  we  consider  an  additional  estimate  of  availability  (Mazumdar  (3|),  and  we  also 
consider  the  intrinsic  properties  of  the  estimaies  from  the  point  of  view  of  point  estimation. 
Our  main  result  can  be  stated  as  follows:  from  the  point  of  view  of  providing  interval  esti¬ 
mates,  the  jackknife  estimate  of  Gaver  and  Chu  appears  to  be  suitably  valid  and  robust,  and  it 
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performs  as  well  or  better  than  the  other  contending  estimates.  However,  from  the  point  of 
view  of  point  estimation,  when  the  intrinsic  property  of  probability  of  concentration  (Rao  IS)) 
is  considered,  the  uniformly  minimum  variance  unbiased  (UMVU)  estimator  seems  to  perform 
as  well  or  better. 

2.  THE  MODEL  AND  THE  ESTIMATES  OF  AVAILABILITY 


We  assume  that  the  times  spent  by  the  process  in  the  "up"  and  "down"  state  are  mutually 
independent,  and  we  have  at  hand  a  sequence  of  In  observations  giving  the  successive  up  and 
downtimes,  U\,  D\,  (J 2,02,  ...  .  U„,  D„  of  the  system-  Except  for  one  case,  we  shall  make 
the  usual  assumption  that  the  uptimes  (/,  have  the  exponential  distribution.  The  downtimes  D( 
are  identically  distributed,  and  in  Section  3,  we  shall  assume  that  their  common  distribution 
belongs  to  the  exponential,  gamma,  lognormal  or  the  Weibull  family.  The  parameter  of 
interest,  availability,  is  given  by 


(1) 


ElU) 

£li/]  +  E[D\ 


where  E[V)  and  £)Z)]  are,  respectively,  the  expectations  of  U ,  and  Dt. 


We  consider  the  following  estimators  of  availability: 


(2) 


(a)  the  maximum  likelihood  estimate: 

3  „  V 

m,e  V  +  D 


where  U  -  £  UJn  and  Z)  «  £  DJ n. 
/-I  ;-l 


(b)  the  jackknifed  maximum  likelihood  estimate: 


(3) 

:  ,  v 

,km,e  V  +  £ 

where 

(3a) 

y-Z  yjn,  e -  z 

i-l 

(3b) 

K  -  nZ7  -  <«  -  1  )U-it 

(3c) 

(3d) 

Et  —  nD  -  (n  -  1  )Z>_(, 

and 

(3e) 

(4) 


(c)  the  log-logistic  jackknifed  estimate  (Gaver  and  Chu  (2)) 


1  +  e 2 
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where 

(4a) 

Z  = 

1  n 

7  Iz' 

n  >=1 

(4b) 

Z>- 

nZ  -  {n  -  1)  Z., 

(4c) 

Z-, 

- ,n  \i  ui  - in  i,?.  °i 

(4d) 

z  = 

In  {U)  -  In  (D). 

(d)  the  uniformly  minimum  variance  unbiased  estimate  (Mazumdar,  (31):  This  estimator 
was  derived  for  the  case  when  both  U,  and  Dt  are  exponentially  distributed  with  un¬ 
known  parameters  k  and  fi,  respectively.  Denote  UlD  by  S.  Then  the  estimator  is 
given  by 


(5a)  A  umvu  (** ) 


1  -  (n  -  1)  £ 

/-0 

/i  —  2 

1  -  (n-l)  I 

/- o 


«  -  1  s'n  -  5)"-'-' 

J  J  n-l+j 
n~2\ 

J  \  n  +  j 


if  S  <  1 


ifS  >  1. 


The  above  reduces  to: 

ln-1 


An) 


1 


S  - 


n-l 

2 

.  c2  J.  . 

n-l 

3 

n  +  1 

O  T 

n  +  2 

2 

3 

S}  ~  ...  +  (-1)" 


n-l 

n-l 


\ 


In -2 
n-l 

if  S'  <  1 


and 


(5b) 


-4u  mvu(^t)  ( 


n  —  1 
1 


s-'  + 


n-l 

2 


S'2  -  ...  +  (-1) 


n- 1 


n-  1 
n  —  1 


r»-  < n  - 1 ) 


n  +  l 
2 


2n  —  2 
n-l 


if  S  >  1. 


(5c) 


Some  examples  of  Aumvv  are  given  below.  When  n  -  2, 

1/2S  if  S  <  1 
1  -  1/2S"'  if  S  >  r 


^umvu^) 


When  n  -  4, 


(5d) 


^umvu(^ 


S  -  ~r  S1  +  rzr  S3 


10 


20 


if  S  S?  1 


l-?s',  +  ¥s_3-is_3  ifS>1 
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(6a) 


(3)  the  jackknifed  uniformly  variance  unbiased  estimate. 
1 


-Vumv^n)  “  ~  Z  Wi(n) 


where 

(6b)  H/,(//)  -  «/4ufnvu(«)  -  (n  -  l)/4_umvu/(n  -  1) 

*4  umvu(  n )  is  given  by  (5b), 


An  -  1) 


n-1 

1 


n-  1 
1 


S-i  - 


n  —  2 
2 


Si,  + 


n  —  2 
3 


«  +  l 
3 


. 


+  (-1)"- 


^-umvu.i^  1 )  1 


I"-2 

i 


n-2 
n  —  2 

2n  -  4 
n-2 

Iff  —  2 


?n-2 


if  S-,  1, 


n-2 


»  1 

-  s:  +  *-7—-  si,  -  ...  +  (-i)"-z  - 

*  r 

n  —  1 

M 

2n  +  4 

1 

W 

n-2 

s: 


(6c) 

and 

(6d) 


if  S_,  >  1 


U_, 

-  ~ 


In  the  terminology  related  to  jackknife  estimates,  the  quantities  £,,  Z,,  If,  are  referred 
to  as  pseudovalues.  Let  0  be  an  estimator  of  the  parameter  0  based  on  a  sample  of  size  n.  Let 
0,  refer  to  the  set  of  pseudovalues  in  this  situation,  -  1,2,  ....  n  and  let 


(7) 


-  i  »  - 


/-I 


It  has  been  stated  (Miller  [4]),  that  under  certain  general  conditions,  the  statistic 

(8)  - 


— t—  £  <#,  -  «)• 
»-*  £7  j 


T7T 


has  an  approximate  t  distribution  with  (/»  -  1)  degrees  of  freedom.  Thus,  the  approximate 
two-sided  100  (1  -  a)%  confidence  interval  for  0  will  be  given  by  (La,  Ua)  where 

1 1/2 

(9)  u  -a  a-,  J — ! —  V  (a  _  a»2l  / »i/* 


1  »  -  |1/2  / 
-f  ^rr ,5 - 9,1 
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L 


a 


£  («,  -  #)2 

/-i  I 


„,/2 


where  „  refers  to  the  Jl  -  y 
dom. 


100%  percentile  of  Student's  /  with  n  -  1  degrees  of  free- 


In  a  recent  paper,  Rao  [51  has  commented  on  the  wisdom  of  using  minimum  mean  square 
error  as  the  only  criterion  for  obtaining  optimum  point  estimates.  He  has  suggested  that 
because  of  mathematical  convenience  and  intuitive  considerations,  the  minimum  mean  squared 
error  could  be  used  as  a  postulate  to  derive  estimators,  but  he  has  recommended  that  their 
acceptability  should  be  judged  on  more  intrinsic  criteria  such  as  the  probability  of  concentration 
(PC),  where  this  quantity  measures 

(10)  PC  =  Pr  [lEstimate  —  True  Value!  <  a} 
for  different  values  of  a. 

For  the  purpose  of  comparing  the  performance  characteristics  of  the  estimates  (a)-(e),  we 
choose  the  following  three  criteria:  (i)  estimate  probability  of  coverage  of  the  true  availability 
using  the  formula  (9)  corresponding  to  a  nominal  value  a;  ii)  the  estimated  mean  and  variance 
of  the  confidence  interval  length;  and  iii)  the  estimated  probability  of  concentration.  (The 
mean  squared  error  was  used  in  [3]  to  compare  the  maximum  likelihood  and  the  umvu  esti¬ 
mate,  and  we  do  not  repeat  the  use  of  this  criterion  here  )  These  estimates  are  obtained  by 
performing  Monte  Carlo  simulation  experiments  where  1,000  synthetic  system  realizations  were 
observed  through  a  cycle  of  n  consecutive  up  and  downtimes. 

Following  Gaver  and  Chu  [2],  we  assume  the  following  distribution  forms  for  the  up  and 
down  times  for  the  purpose  of  the  simulation  study.  Although  these  distribution  forms 
represent  special  cases,  they  are  representative  of  many  different  practical  situations. 

(A)  U  is  exponentially  distributed,  E\U]  —  A-1;  D  is  exponentially  distributed, 
E\D\  -  n~'\  \U, I,  |D, I  are  mutually  independent  sequences  of  independent  random 
variables. 

(B)  U  is  exponential  with  E[L)  -  A-1;  D  is  gamma  dislribuled  with  E\D)  -  (kfi)'';  Var 
(Dl  -  (VaV)~2,  where  k  and  n  represent,  respectively,  the  shape  and  scale  parame¬ 
ter  of  the  distribution.  The  successive  up  and  dowtimes  are  mutually  independent  as 
in  (A). 

(C)  U  is  exponential  with  E(U)  -  A1;  D  is  Weibull  distributed  with  the  scale  and  shape 
parameters  y  and  5,  respectively,  given  by  the  following  density: 

(11)  fD(d)  -  /6r*-|e-'1"’‘  ;  t  Js  0. 

The  successive  up  and  downtimes  are  independently  distributed. 

(D)  U  is  exponential  with  EMJ )  —  A'1;  D  is  lognormally  distributed  with  parameters  n 
and  <r,  given  by  the  following  density: 

(12)  fD(d)  -  -7= — exp(-l/2(lflrf  - /a)Vcr2];  d  >  0. 

yl2ir<rd 
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The  successive  up  and  downtimes  are  independently  distributed. 

(E)  U  is  represented  by  a  long-tailed  It  distribution,  see  Gaver  and  Chu  (2).  For  a  fixed 
value  of  the  tail-stretch  parameter  /»,  (0  <  h  <  1),  represent  an  uptime  as 


A 

where  X  is  exponentially  distributed  with  £[,T]  =  1.  The  sequence  { £/,}  is  one  of 
independent  random  variables,  themselves  independent  of  (£)J.  The  downtimes  are 
here  assumed  independent  and  exponential. 

In  case  (A),  the  ratio  U/D  is  proportional  to  a  statistic  having  the  F  distribution  with 
(2 n,  2 n)  degrees  of  freedom.  Thus,  an  exact  confidence  interval  for  the  availability  parameter 
A  can  be  obtained  in  this  situation  with  the  help  of  the  F-tables.  The  confidence  intervals 
using  the  F-statistic  were  also  computed. 

3.  NUMERICAL  COMPARISONS 

In  this  section  we  compare  the  different  estimators  using  the  three  criteria  described 
above  for  the  probability  models  (A)  to  (E).  1,000  Monte  Carlo  realizations  were  used  to 
obtain  the  estimates  of  these  performance  measures.  As  far  as  practicable,  the  same  random 
numbers  were  used  for  the  purpose  of  the  comparison.  We  examined  the  case  where  n  —  15. 
Table  1  gives  the  properties  of  the  confidence  intervals  obtained  from  (9).  Table  2  gives  the 
properties  of  the  point  estimates  using  the  PC  criterion  of  equation  (10). 

TABLE  1  —  Simulation  Results'  (1000  Runs)  Comparing  the  Two-sided  95%  Confidence 
Intervals  of  Carious  Estimators  of  Availability;  n  —  15,  t  =*  2.145 
( True  Value  of  Availability  =  0.9901). 


Underlying 

Distributions 

Coverage 

(%) 

Average  Length 

Variance  of 
Length 

A.  (exponential. 

F:95  4 

1.65  x  10~2 

3.65  x  10-5 

exponential) 

JK.LL:94.5 

1.87  x  10-2 

8.00  x  10- 5 

A  -  0.01.  -  1 

JK.UMVU:92.3 

1.54  x  10~2 

4.59  x  lO’5 

JK,MLE:93.2 

1.64  x  10-2 

5.02  x  10~5 

B  (exponential. 

F:98.4 

1.67  x  I0-2 

2.52  x  10-5 

gamma) 

JK,LL:94.7 

1.40  x  I0-2 

3.21  x  I0~5 

A  -  0.01 ,  n  -  3.0 

JK.UMVU:93.4 

1.29  x  10-2 

2  64  x  Itr- 

k  -  1/3 

JK.MLE94.8 

1.38  x  I0-2 

2.89  x  lO'5 

C.  (exponential. 

F:98.0 

1.67  x  I0‘2 

2.65  x  10- 5 

Weibull) 

JK.LL94.3 

1.39  x  lO'2 

3  24  x  lO”5 

A  -  0.01,  r  —  113 

JK.UMVU.92.5 

1  29  x  10~2 

2,71  x  !0-5 

8  -  2.0 

JK.MLE940 

1.38  x  I0~2 

2.98  x  lO"5 

D  (exponential. 

F:92  7 

1  68  x  I0'2 

5.30  x  lO*5 

lognormal) 

JK.LL94.9 

2  28  x  lO’2 

4.37  x  lO*4 

A  -  0.01,  n  -  0.50, 

JK.UMVU:92.5 

1  66  x  |0-2 

8  86  x  lO"5 

<r  -  1.0 

JK.MLE89  8 

1  76  x  lO'2 

9.53  x  lO"5 

E.  (long-tailed  h. 

F882 

1.77  x  JO"2 

6.74  x  |(T5 

exponential) 

JK.LL93.4 

2.41  x  lO'2 

1.60  x  ir4 

A  -  0.01,  M  -  1. 

JK.UMVU  92  7 

1.91  x  10"2 

8.48  x  I0-* 

h  -  0.2 

JK.MLE:94.0 

2.01  x  I0’2 

9.18  x  10-5 

<*F:  F-statistic;  JK.LL  "Log-Logistic  Jackknife;'  JK.UMVU:  "Jackknife  UMVU;” 
JK.MLE:  "Jackknife  Maximum  Likelihood  Estimate) 
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TABLE  2  —  Simulation  Results  (1000  Runs)  Comparing  the  Probabilities  of 
Concentration  for  Various  Estimates  of  A  vailability;  n  —  15. 

(PC  —  Pr  (| Estimate  -  Availability  <  «)] 


1  Underlying 

j  Distribution 

Estimate 

Probability  of  Concentration  (PC)  in  % 

a  -  0.001 

a  -  0  0025 

a  =  0  005 

a  -  01 

A. 

(exponential. 

‘jmle 

22  9 

53  8 

83  5 

97  4 

exponential) 

5lk,mlt 

20  8 

52.7 

86. 1 

98.3 

A  -  0.01.  -  1 

^umvM 

22  5 

53.9 

860 

98  3  | 

^ik.umwu 

21.5 

52.6 

85  8 

98  2  1 

Pt.  1 1 

224 

53  4 

83.4 

97.7 

B 

(exponential. 

28  1 

611 

890 

98.6 

gamma) 

26.6 

61  6 

91  5 

990 

A  -  0  01,  n  =  3  0 

28.1 

62. 1 

91.7 

99  1  j 

k  -  1/3 

'jjk.umvu 

26.9 

61  9 

91.3 

99.0  1 

Alk.  1 1 

28.1 

61  8 

898 

98.7  1 

C 

(exponential. 

'jm  k 

26.2 

59  9 

89.1 

98.3  | 

Weibull) 

A/k.mlr 

25.8 

59  1 

91.8 

98.9  1 

1 

1 

** 

O 

© 

H 

-er 

26  9 

608 

92  4 

99.0  j 

| 

8  -  2  0 

'i/k.umvu 

26.1 

59  3 

91  6 

98.9  | 

| 

Ajk.  1 1 

26  0 

59.4 

90  5 

98  4  | 

P 

(exponential. 

‘imle 

205 

47.1 

82  5 

95  1  [ 

lognormal) 

)lk.mlr 

19.9 

47.4 

83  3 

966  | 

A  -  0  01,  M  -  0  50 

'jumvu 

19.1 

469 

84.6 

964 

| 

O 

II 

b 

A 

‘fjk.umvu 

20.4 

47.4 

836 

966 

Aik.  1 1 

20.0 

47.2 

806 

94  6 

E 

(long-tailed  h. 

Afnlt 

16  5 

42  6 

73.5 

92  8 

exponential) 

Alk.mle 

152 

39  2 

70  8 

94.5 

K  -  0.01 ,  n  -  1 

17.0 

43.4 

749 

95.1 

6  -  0  2 

A 

'fik.umvu 

15  5 

40.0 

71  1 

94  1  | 

_ 

_4ti _ 

16.5 

40  7 

72.1 

93.0  | 

Table  1  shows  that  the  log-logistic  jackknife  estimate  provides  approximate  nominal  cover¬ 
age  of  the  true  parameter  value  under  a  wide  variety  of  situations.  In  comparison,  the  jack¬ 
knifed  uniformly  variance  unbiased  estimate  or  the  jackknifed  maximum  likelihood  estimate 
does  not  fare  as  well.  However,  when  the  probability  of  concentration  of  the  point  estimate  is 
considered,  the  uniformly  minimum  variance  unbiased  estimate  performs  very  well.  In  many 
cases,  it  performs  better  than  the  maximum  likelihood  estimate  or  the  jackknife  log-logistic 
estimate.  These  findings  are  not  surprising  in  view  of  the  more  pronounced  locally  linear 
characteristic  of  the  log-logistic  estimate. 

4.  SUMMARY  AND  CONCLUSIONS 

Several  competing  estimates  of  availability  of  a  system  which  alternates  between  two 
states  —  "up"  and  "down"  have  been  compared.  The  data  is  assumed  to  consist  of  n  complete 
cycles  of  successive  up  and  downtimes.  Monte  Carlo  simulations  carried  on  several  special  but 
representative  situations  indicate  the  general  validity  and  robustness  of  the  log-logistic  jackknife 
estimate  of  Gaver  and  Chu.  However,  when  point  estimates  are  considered,  from  the  point  of 
view  of  probability  of  concentration,  the  uniformly  minimum  variance  unbiased  estimate  for 
the  Markov  case  performs  very  well  in  these  situations. 
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ABSTRACT 

In  this  paper  we  consider  a  simple  three-order-statislic  asymptotically  un¬ 
biased  estimator  of  the  Weibull  shape  parameter  c  for  the  case  in  which  all 
three  parameters  are  unknown.  Optimal  quantiles  that  minimize  the  asymptotic 
variance  of  this  estimator.  £  are  determined  and  shown  to  depend  only  on  the 
true  (unknown)  shape  parameter  value  c  and  in  a  rather  insensitive  way. 
Monte  Carlo  studies  further  verified  that,  in  practice  where  the  true  shape 
parameter  c  is  unknown,  using  always  c  with  the  optimal  quantities  that 
correspond  to  c  —  2  0  produces  estimates,  £*,  remarkably  close  to  the  theoreti¬ 
cal  optimal.  A  second  stage  estimation  procedure,  namely  recalculating  chased 
on  the  optimal  quantiles  corresponding  to  £*,  was  not  worth  the  additional 
effort.  Benchmark  simulation  comparisons  were  also  made  with  the  best  per¬ 
centile  estimator  of  Zanakis  [201  and  with  a  new  estimator  of  Wyckoff.  Bain 
and  Engelhard)  (181,  one  that  appears  to  be  the  best  of  proposed  closed-form 
estimators  but  uses  all  sample  observations. 

The  proposed  estimator,  £*,  should  be  of  interest  to  practitioners  having 
limited  resources  and  to  researchers  as  a  starting  point  for  more  accurate  itera¬ 
tive  estimation  procedures.  Its  form  is  independent  of  all  three  Weibull  param¬ 
eters  and,  for  not  too  large  sample  sizes,  it  requires  the  first,  last  and  only  one 
other  (early)  ordered  observation.  Practical  guidelines  are  provided  for  choos¬ 
ing  the  best  anticipated  estimator  of  shape  for  a  three-parameter  Weibull  distri¬ 
bution  under  different  circumstances. 


I.  INTRODUCTION 

The  cumulative  distribution  function  of  a  three-parameter  Weibull  variate  AT  is  given  by: 
(1)  Fix )  -  1  -  exp(-[(x  -  a)/b]c)  x  ^  a 

•Research  performed  by  this  author  was  supported  by  the  US  Office  of  Naval  Research  under  Contract  No.  N000I4- 
76-C-0723. 
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where  "a"  is  the  threshold,  or  location  parameter.  " b  "  is  the  scale  parameter  and  V"  deter¬ 
mines  the  shape  of  the  distribution.  See  Johnson  and  Kotz,  |7]  and  Mann.  Schafer,  Sing- 
purwalla  [11)  for  comprehensive  information  about  this  distribution  and  its  applications. 

Maximum  likelihood  estimates  (MLEs)  of  all  three  parameters  have  desirable  asymptotic- 
properties.  However,  since  no  closed-form  expression  for  the  MLEs  exists,  the  use  of  a 
numerical  iterative  procedure  is  necessary  to  solve,  for  a  given  sample,  the  corresponding  non¬ 
linear  optimization  problem.  This  is  a  computationally  difficult  problem  for  which  many  algo¬ 
rithms  fail  to  provide  solutions  [19].  Other  deficiencies  of  the  maximum-likelihood  procedure 
as  it  applies  to  this  three-parameter  distribution  are  detailed  in  [16], 

Practitioners  having  limited  time  and  access  to  analysts  or  computing  facilities  would  thus 
prefer  using  closed-form  (analytic)  estimators.  However,  only  rather  inefficient  estimators  cf 
this  type  for  the  three-parameter  Weibull  distribution  have  been  available  in  the  past  [11], 
Recently,  more  than  seventeen  closed-form  estimators  for  the  three  Weibull  parameters  were 
compared  by  Zanakis,  and  the  mcst  efficient  ones  identified  [20], 

Of  the  three  Weibull  parameters,  the  shape  parameter,  c,  is  particularly  important  because 
it  affects  the  shape  of  reliability  and  hazard  rate  curves  (see  [7],  [12]).  and  is  the  primary  cause 
of  computational  difficulties  and  estimation  errors  in  MLE  problems  [10],  [21] 

Here,  we  examine  an  improved  simple  percentile  estimator,  r,  of  the  shape  parameter, 
with  quantiles  determined  optimally  and  independently  of  the  other  two  Weibull  parameters. 

2.  A  PERCENTILE  ESTIMATOR  OF  SHAPE  e  WHICH  IS  INDEPENDENT 
OF  THE  LOCATION  AND  SCALE  PARAMETERS. 

Let  r,  <  /2  <  . . .  ^  /„  be  the  ordered  observations  from  a  random  sample  of  size  n  from 

(1). 


For  the  two  parameter  Weibull  (when  the  location  parameter  is  known  or  zero)  Dubey  [3] 
proposed  a  two  percentile  (p,  <  pk)  estimator  for  the  shape  parameter  c: 

(2)  c  =  ln(ln(l  -  /zA. )/ln(  1  -  p,)\l\n\tkl  t,) 

with  i,  =  /(^ |+i  and  tk  =  t\„Pki+\.  \np,)  being  the  greatest  integer  less  than  or  equal  to  npr 

He  showed  (as  was  shown  earlier  by  Lieblein  [9]  in  investigating  the  extreme-value  distri¬ 
bution)  that  its  asymptotic  variance  is  minimized  when 

(3)  p,  —  0.16731  and  pk  —  0.97366. 

Murthy  and  Swartz  [13]  derived  unbiasing  factors  and  hypothesis  tests  for  c.  Recently,  Bryson 
[1]  argued  that  the  asymptotic  efficiency  of  r  is  not  especially  sensitive  to  small  deviations  from 
the  optimal  percentiles. 

In  a  recent  paper  involving  simple  estimators  for  all  three  Weibull  parameters  [20],  five 
estimators  of  c  were  compared.  These  included  the  following  proposed  simple  percentile  esti¬ 
mator, 

(4)  c  =  0.5ln[ln(l  -  )/ln(  1  -  /?, ) ]/ln [(/*  -  tj)/(tj  -  /,)] 
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where  p,.  pr  pk  are  three  quantiles  satisfying 

(5)  0  <  p,  <  p,  <  pk  ^  1 

(6)  — In ( 1  —  Pj)  =  {ln(l  —  p,)  •  In ( 1  —  pk)Yn 

and  t,.  ir  ik  (/,  <  i,  <  t k )  are  the  corresponding  three  ordered  -.ample  observations.  (See  also 
Johnson  and  Kotz  [71.) 

This  estimator  requires  only  three  sample  observations  and  is  independent  of  the  other 
two  Weibull  parameters.  Thus,  when  all  three  parameters  are  unknown,  estimator  c  is  appeal¬ 
ing  if  the  three  percentiles  can  be  selected  in  some  optimal  way.  It  was  shown  before  [201  and 
verified  here  that  using  (3)  and  (4),  i.e., 

<7)  c2  =  1 .494/ln [( /*  -  r,)]/ft  -  /, )  1 

produces  a  rather  poor  behavior. 

The  purpose  of  this  paper  is  to  derive  optimal  (or  nearly  optimal)  quantiles  p,  and  pk  (p, 
then  obtained  from  (6))  for  c  when  all  three  Weibull  parameters  are  unknown  and  to  compare 
the  estimators  based  on  these  optimal  quantiles  with  certain  othcs  which  also  require  no  com¬ 
plicated  iterative  procedures.  Such  simple  estimators  can  be  used  to  obtain  final  estimates  or  as 
initial  estimates  for  iterative  procedures. 


3.  OPTIMAL  QUANTILES  FOR  c 


With  the  use  of  a  theorem  of  Mosteller  [12]  and  a  lemma  of  Rao  [14],  the  following  ana¬ 
lytic  expression  for  the  asymptotic  variance  of  c  was  obtained: 

12 

1  I  If/.  I  /I  I#  I.  —  fl.ll 

(8)  <t>(p,.pk.c )  = 


1 

f,„ 

ft 

7 

ft  -  Qi  2 

4  nc2 

I'" 

ft 

/ 

in 

ft  -  ft 

R, 

1  2 

i  ,  i 

+  ^ 

ft1- '(ft  -  ft) 

gt1  (ft  -  ft>  ftr' 

ft  -  a  a  -  a 

ftr1  (ft  -  ft) 

Rj 

i  ,  i 

i 

1  ,  1  ] 

2 

ftr' 

a  -  a  ft*  -  ft, 

ftr1 

a  -  a  '  ft*  -  a 

ftr'(ft*  -  ft) 

+  — ^ _ J - 

ftr'ftr'  <G*-ft>2j 

where 

R;  =  pj  (1  ~  Ps »  s  =  i.j.k 

and 

ft,  =  [— In ( 1  -  ps)Yu  s  =  i.j.k 


so  that  from  <6) 


Qj=  iQ>Qk)U2- 

Thus,  for  a  given  sample  size  n ,  this  asymptotic  variance  is  a  function  of  the  true  shape  param¬ 
eter  cand  only  two  of  the  three  quantiles. 
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A  pattern  search  procedure  16],  extended  by  Zanakis  [21 J  for  bounded-variable  nonlinear 
optimization  problems  with  or  without  derivatives,  was  employed  to  obtain,  for  selected  values 
of  the  shape  parameter  c ,  the  quantiles  p,  and  pk  that  minimize  <t>  ( p ,,  pk,  c)/n.  The  results  are 
summarized  in  Table  I  along  with  the  function  values  at  the  point  given  by  (3). 


TABLE  1  —  Optimal  Results  for  c 


True  Shape 
Parameter 

c 

Optimal  Values 

Non-Optimal* 

iv<t>(pj,pk,c) 

Asymptotic 

Inefficiency 

4>{p’,pfc)/<t>(p,,pk,e) 

A 

& 

a‘<t>(Pi,Pk-c) 

HB 

0.0086 

0.230 

2.10 

0.0048 

0.9816 

1.028 

3.194 

3.11 

1.5 

0.0028 

3.155 

12.314 

3.90 

0.0033 

34.976 

3.85 

2.5 

0.0051 

0.9932 

23.215 

81.286 

3.50 

3.0 

0.0072 

0.9939 

51.070 

164.374 

3.22 

0.0092 

K  2  9 

99.545 

300.142 

3.02 

0.0109 

Bsiffff 

176.936 

507.770 

2.87 

0.0124 

B  3?  9 

292.965 

809.210 

2.76 

5.0 

0.0137 

B  5®  ■ 

458.762 

1229.437 

2.68 

7.5 

0.0179 

3 

2522.945 

6194.795 

2.46 

0.0202 

0.9960 

8314.425 

19586.645 

2.36 

»  0.16731  and  p'k  -  0.97366 


These  results  reveal  that  the  asymptotically  optimal  quantiles  (p,,p*)  depend  on  the  true 
value  of  the  shape  parameter  in  a  rather  insensitive  way,  especially  for  pk. 

It  should  be  noted  that  using  instead  (p,,p*)  from  (3),  which  is  optimal  for  the  two- 
parameter  Weibull  distribution,  essentially  triples  the  asymptotic  variance  of  estimator  c  if  all 
three  Weibull  parameters  are  unknown.  This  is  more  pronounced  when  the  true  c  —  1.5  to  2.0, 
as  the  last  column  of  Table  l  indicates.  Note  also,  from  Dubey  [3]  and  Kimball  [8],  that  the 
asymptotic  variance  of  c  (given  by  our  Equation  (2))  when  the  location  parameter,  a,  is  known 
to  be  or  can  be  set  equal  to  zero,  is  0.92  c2/ n.  Thus,  incorrectly  assuming  that  "a"  is  nonzero 
makes  the  estimation  of  c  much  more  difficult  than  necessary. 

4.  MONTE  CARLO  INVESTIGATIONS 

In  order  to  gain  further  insight  into  the  behavior  of  the  percentile  estimator  c,  two  Monte 
Carlo  investigations  were  made.  The  first  investigated  the  sensitivity  of  c  to  small  deviations 
from  the  optimal  quantiles  and  whether  a  simple  iterative  scheme  could  improve  the  perfor¬ 
mance  of  c.  The  second  investigation  compared  c  to  a  very  recently  proposed  efficient  percen¬ 
tile  estimator,  c,  based  on  all  order  statistics  [181  and  the  best  percentile  estimator,  c\  found  in 
our  previous  study  (201. 

Sensitivity  of  l 

Since  the  asymptotically  optimal  quantiles  depend  upon  the  true  (unknown)  shape  param¬ 
eter  c  in  sucfi'  an  insensitive  way,  a  Monte  Carlo  study  with  1,000  replications  was  made  using 
c,  defined  by  (4),  (5),  and  (6),  with  p,  and  pk  corresponding  to  optimal  quantile  values  for 
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c  =  2.0  and  3.0.  Over  the  range  of  investigation,  i.e  ,  c  =  0.5  (0.5)3. 5,  somewhat  better  results 
were  obtained  with  c*  corresponding  to  p,  =  0.0033,  pk  =  0  9920  than  with  one  corresponding 
to  p,  =  0.0072,  pk  =  0.9939.  We  define 

(9)  c*  =  3.643/In [ ( rA.  -  /,)/</,  -  r, )] 

where  t,  =  t|ooo33»l+i.  ' k  =  f|o.W20i.l+o  and  /,  =  /|o  t  is7n|+ 1  Note  that  for  most  sample  sizes  of 
interest,  r,  =  t,  and  tk  =  t„. 

In  this  investigation,  we  also  examined  the  following  two  estimators: 

£0:  defined  by  (4),  (5),  (6)  and  based  on  the  optimal  combination  of  p,  and  pk  from 
Table  1,  although  this  would  not  ordinarily  be  possible  since  the  true  c  value  is  not 
known.  For  this  reason,  we  also  examined  the  following: 

c**:  a  single  iteration  modified  estimator 

(10)  c**  =  £(/>?.  p*k) 

with  p*and  pk  selected  in  a  manner  prescribed  by  the  value  of  c*as  given  in  Table  2. 

Table  2  —  Rule  for  Selection  of  p,  and  pk  in 
Calculation  ofc**  =  c  (p*,p*) 


The  schedule  shown  in  Table  2  for  selection  of  combinations  of  values  of  p*  and  pk  was 
determined  from  results  involving  c0  and  c*.  Because  of  the  negative  bias  of  c*  for  values  of 
c  ^  1.5,  a  correcting  term  (ranging  from  0.1  to  0.7)  was  also  investigated  for  calculating  £**  as 
a  function  of  c*.  This  tended  to  decrease  the  bias  of  c**at  the  expense  of  higher  mean  square 
errors.  Early  simulation  results  also  revealed  that  for  n  <  100,  perfect  choice  of  percentiles 
(those  corresponding  to  the  true  but  unknown  c)  reduces  the  bias  but  not  necessarily  the  MSE, 
particularly  when  the  true  c  is  large.  This  theoretical  estimator,  c0,  has  optimal  asymptotic  pro¬ 
perties  and  is  not  necessarily  good  for  small  n  and  large  c  values. 

Comparison  with  Two  Previous  Percentile  Estimators  c'  and  c 

As  a  benchmark  comparison,  the  proposed  estimators  c*and  c**  were  also  compared  via  a 
similar  Monte  Carlo  study  of  5,000  replications  with  the  two  best  percentile  estimators,  c'  and 
c,  suggested  in  earlier  studies  [20  and  18]. 

The  best  of  the  simple  estimators  for  the  shape  of  a  three-parameter  Weibull  distribution 
examined  in  (20)  was  found  to  be: 
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(11)  c'  =  2.989/ln((/»  -  a)/U,  -  all 
where 

(12)  a  -  (i,i.  -  /22  )/(r,  +  i„  -  2/j) 

was  the  best  estimate  of  the  Weibull  location  parameter  and 
2.989  =  Intlnd  -  0.97366/ln(l  -  0.16731)]. 


WyckofT,  Bain  and  Engelhardt  (18]  recently  extended  their  earlier  estimator  c  [4]  via  the 
following  three  step  procedure: 

First  determine  the  shape  parameter  initial  estimator  (Cl 2.20  in  120]) 

(13)  cn=  2. 989/In (( |+, - /,)/(/!„,, |+1  -  /,)] 

with  ph  pk  given  by  (3),  and  the  location  parameter  estimator 
<14>  a  =  (r,  -  llnVi°)l[  1  -  Mn  <01 

where  t  is  the  sample  mean.  Note  the  similarity  of  estimators  (11)  and  (13).  Then  the 
WyckofT,  Bain,  Engelhardt  estimator  is  given  by: 


(15) 


nkj 


-Yln(r,  -  a)  +  — - —  Y  ln(/r-a) 

r-\  «  ~  S  r-Ttl 


where  s  -  [0.84«  ]  and  the  constant  k„  is  tabulated  in  (4). 


This  estimator  uses  all  order  statistics  and  consequently  it  is  not  surprising  that  it  was 
found  in  [18]  to  have  smaller  bias  and  MSE  than  the  much  simpler  estimator  c\  which  is  based 
only  on  three  quantiles.  However,  c'  performed  better  in  [20]  than  the  earlier  version  of  c  pro¬ 
posed  by  Engelhardt  and  Bain  [41. 


Monte  Carlo  results  comparing  the  proposed  new  estimators  c*  and  c**  with  the  previous 
estimators  c'  and  c  are  shown  in  Table  3.  The  nonoptimal  estimator  c2  given  by  (7)  was  also 
computed  in  the  same  simulation,  but  MSE  values  were  so  large  and  unpredictable  for  large 
values  of  c,  that  no  comparisons  were  necessary. 


TABLE  3  —  Comparison  of  Average  Values  and  Mean  Squared  Errors  of  c' ,c,c*,  and  c** 


Average  Value 

True  c 

MSE 

True  c 

0.50 

1.00 

1  50 

2.00 

2.50 

3.00 

3.50 

0  50 

1  00 

1  50 

2.00 

2.50 

300 

3.50 

?’ 

0.52 

0.99 

1.38 

1.70 

1.95 

2.18 

2.36 

0.008 

0.031 

0.087 

0.227 

0.510 

1.031 

1.551 

c 

0.53 

1  04 

1.52 

1.91 

2.28 

2.60 

287 

0008 

0.035 

0  097 

0191 

0391 

0.705 

1.133 

r* 

0.54 

1.01 

1  43 

1.81 

2  13 

2.45 

2.80 

0  011 

0.038 

0  120 

0.313 

0.673 

1.528 

2.258 

?•* 

0.50 

0.99 

1.44 

181 

2  13 

2.45 

280 

0008 

0053 

0120 

0  310 

0668 

1  523 

2.271 

?* 

052 

1.01 

1  42 

1  77 

205 

2.30 

248 

0.005 

0.020 

0050 

0  143 

0343 

0.714 

1.144 

c 

0.53 

1.06 

1.55 

201 

236 

2.75 

309 

0.005 

0.021 

0.052 

0131 

0.236 

0.392 

0.804 

?• 

0.51 

0  98 

1.41 

1.80 

2  16 

2.52 

283 

0.005 

0.021 

0.067 

0.187 

0417 

0851 

1  644 

0.50 

099 

142 

1.80 

2  16 

2.52 

283 

0  004 

0025 

0  063 

0183 

0.414 

0.848 

1.642 

?’ 

0  50 

099 

1.42 

1.79 

209 

236 

2.58 

0002 

0009 

0.027 

0  087 

0.232 

0.518 

0.991 

c 

0.52 

1  02 

1  52 

1.99 

237 

2  78 

3.13 

0.002 

0.008 

0022 

0059 

0129 

0  240 

0.395 

0  50 

097 

1.43 

1  87 

227 

2  70 

3  11 

0  002 

00)1 

0033 

0093 

0  217 

0434 

0823 

051 

1.00 

1  47 

1  87 

227 

2.70 

3  11 

0002 

0013 

0.026 

0  088 

0215 

0.434 

0.823 

Number  of  replications  -  5000 

Number  of  replications  “  5000 
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Estimator's  c**  slightly  better  performance  over  c*  is  not  considered  worthy  of  the  addi¬ 
tional  effort  required.  Therefore,  the  latter  is  instead  recommended  in  all  cases— especially 
when  the  true  c  is  large,  but  with  caution  if  also  n  $  30 

All  four  estimators  behave  similarly  for  small  values  of  o.  particularly  as  n  increases. 
Comparing  the  proposed  estimator  c*  with  our  earlier  c'  we  see  that  the  former  has  a  smaller 
bias  in  almost  all  cases.  The  MSE  of  c*  is  clearly  smaller  than  that  of  c'  when  n  -  100  and 
c  >  2.50.  In  problems  with  large  sample  sizes  n.  c'  tends  to  have  almost  no  variance  around 
its  expectation;  thus,  for  large  n,  the  MSE  of  c‘  consists  mostly  of  the  bias  squared,  with  almost 
no  contribution  from  its  variance. 

Finally,  the  bias  of  c*  is  about  equal  to  that  of  c,  but  the  MSE  of  c  is  smaller  than  that  of 
c*  when  c  >  2.00.  Our  results  about  cand  c'  are  consistent  with  those  in  (18). 

5.  SOME  LITERATURE  EXAMPLES 

The  estimators  examined  earlier  in  this  paper,  along  with  some  other  percentile  estimators 
used  in  120]  were  applied  to  seven  literature  test  problems,  with  known  shape  parameter.  Prac¬ 
titioners  may  find  the  results  summarized  in  Table  4  particularly  useful  However,  these  results 
should  not  be  generalized  since  they  represent  a  static  picture  to  a  few  examples;  something 
like  setting  your  watch  at  9:00  and  keeping  it  there:  it  will  be  an  excellent  estimate  of  time 
twice  a  day. 


TABLE  4  —  Literature  Test  Problem  Results 


Problem 

1 

2 

3 

4 

5 

6 - 

7 

Source 

15] 

15) 

1)7) 

IP) 

I2J 

1)5) 

115] 

Sample  Size 

40 

40 

too 

1 00 

100 

100 

100 

„  a 

10 

20 

0 

1.975 

0 

5  b 

100 

100 

1 

47  072 

47.072 

H  c 

2 

3 

2 

2 

12 

1.328 

1.328 

Zanakis  c2  -  C8* 

7.697 

2.027 

1  360 

1  577 

0986 

1.546 

1.344* 

a 

Zanakis  C12.2K 

2.950 

3  633 

2.nor 

2.1.39 

1.305 

1  436 

1.600 

CT| 

Zanakis  ?'**C  12.234 

1  801 

2.226 

1.714 

1.747 

1.080 

1.350 

1.377 

Hassanein  C25.23t 

1  502 

1  933 

1  755 

1.154 

1.170 

1  321* 

1  041 

£ 

Engelhard! 

&  Bain  C27.23* 

1.657 

1.962 

1.770 

1.225 

1.175* 

1.262 

1.139 

Wyckoff, 

| 

Bain  St 

A 

Englehardt  c 

2  185 

2  571* 

1.970 

1.870* 

1  280 

1.380 

1,444 

Zanakis-Mann  c* 

2.026* 

1.939 

1  712 

1.575 

1137 

1  231 

1  384 

(p,  -  0.0033,  pk  -  0.9920) 

MLE  for  c 

2.48 

2.330 

1.783 

1.857 

1  201  + 

1  330+ 

1,767 

•Best  percentile  estimate  for  problem. 

tMLE  better  than  best  percentile  estimate  for  problem 

(Notation  used  in  (201. 


It  is  interesting  to  note  that  in  five  out  of  the  seven  problems,  one  or  more  simple  per¬ 
centile  estimator  was  more  accurate  than  the  MLEs  (iteratively  obtained  on  a  computer).  As  it 
was  shown  earlier  (19),  121],  this  occurs  mostly  in  problems  with  small  shape  parameter  values, 
particularly  when  the  sample  size  is  small  — a  fact  of  particular  interest  to  practitioners  having 
limited  resources. 
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6.  CONCLUSIONS 

A  simple,  three-order-statistic  estimator  c*  is  developed  for  estimation  of  the  Weibull 
shape  parameter  c,  when  all  three  parameters  are  unknown.  It  was  found  to  compare  very  well, 
in  terms  of  mean  squared  error,  with  an  estimator  of  the  same  form  based  on  an  optimal  selec¬ 
tion  of  order  statistics,  dependent  only  upon  the  unknown  true  value  of  c.  Compared  to  our 
previous  best  simple  estimator  c'  [201,  c*  has  a  smaller  bias  in  almost  all  cases  and  a  smaller 
MSE  in  problems  with  large  values  of  n  and  c.  For  small  values  of  c.  c *  and  c'  are  as  efficient 
as  the  new  estimator  c  of  Wyckoff,  Bain,  and  Engelhardt  (181  which  is  determined  from  all 
ordered  observations  after  a  single  iteration. 

We  feel  that  all  three  percentile  estimators  c*,  c'  and  c  are  useful  to  practioners  and 
researchers,  depending  on  the  particular  circumstances  as  suggested  in  Figure  1 . 


FltiURE  1  Practical  guidelines  for  choosing  an  estimator  for  the  shape  of 
a  three-parameter  Weibull  distribution 


The  following  comments  will  further  clarify  and  support  our  suggestions: 

If  a  computer  code  is  available,  one  is  naturally  tempted  to  use  Maximum  Likelihood 
Estimation  iterative  procedures.  However,  research  has  shown  that: 
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(i)  MLEs  sometimes  do  not  exist  or  represent  a  local  optimum  [16); 

(ii)  The  shape  parameter  is  the  least  accurate  of  the  three  Weibull  parameters  to  esti¬ 
mate,  causing  computational  difficulties  for  MLEs  as  c  gets  larger  [19); 

(iii)  In  nonregular  cases  ( c  <  2),  the  simple  percentile  estimator  c'  was  found  to  be  more 
accurate  than  the  corresponding  MLE,  particularly  when  n  is  small  [211; 

(iv)  the  choice  of  a  percentile  estimator  like  c\  c*  or  c  as  a  starting  point  may  affect  the 
speed  of  convergence,  but  not  so  much  the  final  shape  parameter  estimate  of  a  good 
iterative  MLE  procedure  [21);  and 

(v)  If  the  sample  size  is  too  large,  determination  of  MLE  or  even  c  is  very  time 
consuming— a  real  concern  in  repetitive  situations. 

So,  even  if  a  computer  is  available,  one  may  use  a  percentile  estimator  initially,  addition¬ 
ally  or  instead. 

It  should  be  noted  that  practitioners  and  students  (usually  nonstatisticians)  often  need  a 
good  estimator  of  the  Weibull  shape  parameter  and  do  not  have  the  time  or  access  to  a  com¬ 
puter  facility,  but  only  a  pocket  calculator.  In  such  cases,  estimator  c—  which  uses  all  order 
statistics— will  be  computationally  prohibitive  for  any  but  extremely  small  sample  sizes.  Instead 
we  recommend  using  c*  if  n  >  50,  c'  otherwise.  The  unusual  advantage  of  c*  is  that  it  is 
independent  of  the  other  Weibull  parameters. 

These  quick  estimators  are  also  well  suited  for  obtaining  point  and  interval  estimates  for 
the  otherwise  unattainable  optimum  solution  to  large  scale  integer,  combinatorial  or  nonconvex 
mathematical  programming  problems  [22].  A  sample  of  heuristic  minimum  solutions  fits  natur¬ 
ally  a  three -parameter  Weibull  distribution. 

In  this  paper  we  also  demonstrated  how  the  use  of  optimization  methods  can  change  an 
erratic  estimator  (c2)  into  a  good  estimator  (c*),  by  specifying  the  appropriate  three  order 
statistics  that  minimize  the  asymptotic  variance  of  this  simple  estimator. 
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ABSTRACT 

Must  Newton-type  methods  for  linearly  constrained  optimisation  be  either 
of  the  modified  Newton  or  quasi-Newton  variety  ’  The  contention  of  this  paper 
is  that  explicity  recomputing  part  of  the  projected  Hessian  may  be  superior  to 
both  approaches.  A  computational  comparison  with  MINOS  is  presented 


1.  INTRODUCTION 

In  a  previous  paper  [10],  we  presented  a  new  algorithm  for  solving  the  linearly  con¬ 
strained  nonlinear  programming  problem: 

Minimize  fix ) 
subject  to  Ax  >  b. 

That  algorithm  is  a  modified  Newton  one,  i.e.,  for  a  problem  involving  n  decision  variables,  an 
entire  matrix  of  projected  second  partial  derivatives,  which  may  be  as  large  as  n  x  n,  must  be 
computed  at  each  iteration.  Such  an  approach,  which  requires  order  n 2  function  evaluations 
and  order  n}  arithmetic  operations  (multiplications  and  divisions)  at  each  iteration,  may  not  be 
practical  when  dealing  with  larger  problems,  say  n  >  20.  This  paper  proposes  a  closely  related 
method,  which  requires  substantially  less  computational  work  pet  iteration,  while  retaining  the 
desirable  features  of  our  previous  algorithm. 

The  chief  novelty  of  our  approach  in  this  paper  is  the  idea  of  explicitly  updating  part  of 
the  second  derivative  matrix  at  each  iteration.  To  illustrate  the  idea,  the  user  chooses  a  "pipe 
width,"  »r,  of  rows  to  update  each  iteration,  and  the  "pipe"  moves  from  upper  left  to  lower  right 
from  iteration  to  iteration.  For  example,  if  the  second  derivative  matrix  is  5  x  5  and  n  -  2,  at 
the  first  iteration  the  updated  elements  (indicated  by  asterisks)  would  be 

*  *  0  0  0 
*•000 
0  0  0  0  9  , 

0  0  0  0  0 

0  0  0  0  0 
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at  the  second  iteration, 

0  0  *  *  0 
0  0  *  •  0 
....  0 
....  0 

0  0  0  0  0 

and  at  the  third  iteration 

*  0  0  0  * 
0  0  0  0  * 
0  0  0  0  * 
0  0  0  0  * 


Broadly  speaking,  Newton-like  methods  fall  into  two  categories— modified  Newton  ones 
and  quasi-Newton  ones.  As  indicated  above,  modified  Newton  methods  update  an  entire 
second  derivative  matrix  at  each  iteration,  while  quasi-Newton  algorithms  iteratively  update  an 
approximated  matrix  by  adding  a  matrix  of  low  rank  (typically  rank  1  or  rank  2)  to  the  last  such 
approximation.  While  modified  Newton  methods  generally  perform  well,  they  are  impractical 
for  larger  problems.  Quasi  Newton  methods  require  less  work  (usually  order  n  function  evalua¬ 
tions  and  order  n 2  arithmetic),  but  are  potentially  subject  to  a  number  of  numerical  shortcom¬ 
ings,  chiefly  due  to  the  second  derivative  approximation  process,  related  to  scaling  the  ele¬ 
ments,  restarts,  and  inability  to  deal  with  regions  of  nonpositive  curvature.  Our  idea  is  to  take 
a  middle  ground  between  those  two  approaches,  by  letting  the  user  decide  how  much  of  the 
second  derivative  information  to  recompute  each  time.  Our  mathematical  analysis  shows  that 
such  an  algorithm  is  a  viable  idea,  and  some  computational  evidence  indicates  that  it  may  yield 
solutions  more  rapidly  than  either  equivalent  modified  or  quasi-Newton  methods. 

The  present  method  inherits  four  m^jor  strengths  from  our  modified  Newton  approach. 

First,  the  theoretical  convergence  results  carry  over  to  the  present  algorithm.  Thus, 
under  relatively  mild  conditions,  the  algorithm  can  be  shown  to  converge  to  a  point  satisfying 
first  order  necessary  optimality  conditions.  Under  somewhat  stronger  conditions,  the  methods 
find  a  point  also  satisfying  second  order  necessary  conditions.  The  rate  of  convergence  is  also 
shown  to  be  superlinear,  that  is  (IU*+I  -  3f||/IU*-  x||)  —  0  as  k  — *  «\  or  of  order  two 
(IU*+1  -  jell  <  c||x*+l  -  3c||2  for  some  c),  depending  on  the  assumptions  made,  where  (**} 
is  the  sequence  of  points  generated  by  the  method  and  x  is  the  optimal  solution. 

Second,  the  method  is  reasonably  easy  to  use.  It  does  not  require  the  user  to  provide 
analytical  derivatives.  However,  while  we  have  had  some  success  in  using  this  approach  on 
problems  whose  derivatives  are  not  smooth  (May,  Shocker  and  Sudharshan  [11]),  the  conver¬ 
gence  proofs  do  assume  such  properties.  Our  computational  experience  indicates  that  the  per¬ 
formance  of  the  method  is  sensitive  to  only  a  few  parameters,  but  will  converge  dependably 
even  with  highly  nonoptimal  parameter  values. 

Third,  the  algorithm  can  deal  directly  with  areas  of  nonpositive  curvature.  When  minim¬ 
izing  a  function,  and  using  a  Newton-like  method,  one  would  like  the  second  derivative  matrix 
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(or  its  projection,  if  appropriate)  to  be  positive  definite,  since  then  the  usual  search  direction 
generation  approach  yields  a  descent  (improvement)  direction.  When  the  second  derivative 
matrix  is  not  positive  definite,  though,  a  modified  approach  is  required  if  we  want  to  be  sure  to 
obtain  a  useful  search  direction.  Now  the  second  derivative  matrix  is  stored  in  a  factorized 
form,  so  it  can  be  efficiently  used  in  the  solution  of  the  set  of  linear  equations  yielding  the 
search  direction.  In  our  modified  Newton  method,  we  utilized  the  modified  LDLr  factorization 
of  Gill  and  Murray  (6],  where  £  is  a  unit  diagonal  lower  triangular  matrix,  and  D  is  a  diagonal 
matrix  with  all  positive  entries.  That  factorization,  when  applied  to  a  nonpositive  definite 
matrix  A ,  will  actually  find  the  factors  of  A  -f  £,  where  E  is  a  diagonal  matrix  with  posilive 
entries,  instead  of  breaking  down,  as  the  usual  LDLT  factorization  would.  In  our  present  algo¬ 
rithm,  the  Gill  and  Murray  LDLT  factorization  could  not  be  used,  since  we  could  not  find  a 
way  of  updating  it  in  order  n2  arithmetic  when  the  second  derivative  matrix  is  modified  by  the 
addition  of  a  rank-1  matrix.  Instead,  we  use  the  factorization  of  Bunch  and  Parlett  [2],  which 
they  denote  QTMDMTQ .  where  Q  is  a  permutation  matrix,  M  unit  diagonal  lower  triangular, 
and  D  block  diagonal,  i.e.,  it  may  have  entries  in  the  first  sub-  and  super-diagonal,  with  only  I 
x  1  and  2x2  blocks  allowed.  (We  will  suppress  the  permutation  matrix  Q  to  simplify  the 
notation.)  Sorenson  [17]  has  shown  how  to  modify  the  MDMT  factorization  when  a  rank  I 
matrix  is  added  to  the  matrix  under  consideration.  While  modified  Newton  methods  often  have 
provisions  lor  dealing  with  the  case  of  nonpositive  definiteness,  using  special  line  search  tech¬ 
niques,  our  method  appears  to  be  the  first  one  which  extends  these  methodologies  to  a  quasi- 
Newton  environment. 

Finally,  the  algorithm  is  a  "least-constrained"  method,  in  the  sense  of  Lenard  [8],  which 
means  it  may  find  the  optimal  constraint  set,  and  thus  converge,  quickly.  Since  our  approach 
uses  an  e-active  constraint  strategy,  it  is  desirable  to  identify  those  constraints  met  with  equality 
at  the  optimal  solution  as  quickly  as  possible.  A  least  constrained  method  allows  for  the  drop¬ 
ping  or  adding  of  several  constraints  from  the  active  set  at  each  iteration,  as  opposed  to  the 
usual  simplex-technique  related  strategy  of  dropping  or  adding  only  one  at  a  time. 


2.  THE  COORDINATE  SYSTEM 


A  basic  idea  in  our  approach  is  the  representation  of  the  locally  feasible  region  in  the 
neighborhood  of  a  point  x  by  a  matrix  factorization  of  the  active  constraint  matrix  N.  That  is, 
the  columns  of  N  are  the  normals  to  the  constraint  considered  active  at  x.  Since  we  always 
assume  that  N  is  full  rank,  for  an  n  x  u  matrix  A,  we  can  find  an  n  x  n  orthogonal  matrix  Q 
and  a  u  x  u  upper  triangular  nonsingular  matrix  R  such  that 


QN  - 


R 

0 


where  0  is  a  matrix  of  zeros. 


Now  the  last  (n  —  u)  rows  of  Q  are  all  orthogonal  to  the  columns  of  A,  which  means  that 
they  define  a  set  of  vectors  spanning  the  linear  manifold  defined  by  the  active  constraints. 
Moving  away  from  x  along  any  such  row  keeps  all  the  active  constraint  active.  Also,  the  gen¬ 
eralized  inverse  of  N,  N+ ,  is  given  by  N+  -  [/?_l0]@.  The  rows  of  N+  have  the  property  that 
movement  away  from  x  along  the  first  row  of  A+,  say,  corresponds  to  dropping  the  first  active 
constraint  while  retaining  all  the  others. 
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The  rows  of  N*.  plus  the  last  (//  -  u)  rows  of  Q ,  constitute  our  coordinate  system  for 
performing  an  iteration  of  the  method.  Since  N  changes  with  any  change  in  the  active  set,  so 
does  the  coordinate  system.  The  method  is  thus  adaptive,  adjusting  itself  to  the  local  structure 
of  the  feasible  region.  This  adaptability,  though,  implies  a  need  for  proper  second  derivative 
matrix  maintenance,  as  outlined  in  Section  5. 

Note  that  by  the  orthogonality  of  Q,  the  two  sets  of  rows  are  orthogonal,  but  that  the 
rows  of  N+  are  not  necessarily  mutually  orthogonal. 

3.  THE  ALGORITHM 

The  algorithm  requires  positive  real  numbers  a./3,y,e,r.  and  p,  with  p  <  I  and 
02  <  p/2p.  a  and  0  are  used  for  move  point  acceptability,  and  p  is  used  for  a  second-order 
Armijo  type  test  for  Newton  point  accep'ability.  An  e-active  constraint  strategy  is  incorporated 
by  letting  any  column  A ,  of  A  satisfying  (A^x  -  b,  <  es  be  considered  active,  y  is  of  the 
order  of  machine  precision,  and  is  introduced  to  avoid  numerical  difficulties,  t  is  required  in 
the  theoretical  development  as  a  lower  bound  on  the  negative  curvature  curve  stepsize  but 
could  be  set  arbitrarily  small.  Also  required  are  an  initial  feasible  point,  .vn.  an  initial  stepsize 
s°  >  0,  an  initial  //-vector  <rn  of  -H’s  and  —  I's,  a  symmetric  matrix  <7°,  and  ir,  the  number  of 
Hessian  rows  /  columns  to  be  updated  each  iteration. 

Let  ,V+  and  Q  be  determined  by  QR  factorization  of  N.  Denote  the  /  th  column  of  [A+]7 
by  n,  and  /  th  column  of  Qr  by  </,,  that  is, 

[Ar+]A=  [»,  //,...  //„  1  and  Qr  «=  \q\  </;...  <7„1. 

Initialize  the  current  solution  point  x  =  ,v°,  the  current  stepsize  s  -  s°,  the  current  direc¬ 
tion  sign  indicator  a  =  o-n,  the  current  matrix  of  QR  approximate  second  partial  derivatives 

G  =  G°,  the  sequence  index  k  =  0.  the  last  updated  column  /  =  //.  and  the  local  variations 

failure  indicator  /  =  0. 

General  Iteration 

Step  1:  (Determine  active  constraints.)  Set  the  square  of  the  gradient  norm  172  =  0. 
Determine  A  for  the  current  x  and  s.  If  the  active  set  has  changed  since  the  last 
execution  of  this  step,  update  Q ,  A+,  and  G. 

Step  2:  (Computer  second  order  multiplier  approximations.)  For  each  constraint  /, 
i  -  1,2,  ....  w,  compute  A/,  an  approximate  Karush-Kuhn-Tucker  multiplier,  by 
evaluating  /  at  two  feasible  points,  a  stepsize  and  a  half  a  stepsize  along  nh  and 

using  a  three  point  forward  difference  approximation.  If,  for  any  0,  A/,  >  0  and 

neither  of  the  two  points  evaluated  along  nt  yields  an  improvement,  dropping  con¬ 
straint  /  would  be  undesirable,  so  set  rf,  -  0.  For  each  /  such  that  A f  <  0  or  one 
of  those  two  points  does  yield  an  improvement,  compute  an  approximate  second 
partial  derivative  A2/,  set  d,  -  IA/-I,  and  let  tj2  -  tj2  +  (A/;)2. 

Let  xB  denote  the  point  with  lowest  objective  function  value  amongst  all  those  evaluated 
at  the  step. 
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Step  3:  (First  and  second  derivatives  for  manifold  directions  explicitly  updated).  Let  q, 
denote  the  row  of  Q  corresponding  to  the  y'th  row  of  G ,  which  is  to  be  explicitly 
updated  this  iteration.  Then  evaluate  f  at  two  points,  one  a  feasible  stepsize  along 
q,  away  from  x  and  the  other  a  feasible  stepsize  along  —  q,  away  from  x.  Using  a 
central  difference  formula,  compute  gt  and  Gjr  approximate  first  and  second  par¬ 
tial  derivatives  along  q,  (the  discrepancy  in  subscripts  is  due  to  the  fact  that  we  are 
interested  in  only  the  last  (n  —  u)  rows  of  Q ).  Set  17 2  =  n2  +  g2.  adding  g ,  to  the 
norm  of  the  (projected)  gradient. 

Update  xB  to  represent  the  better  of  xB  from  the  last  step  and  the  lowest  function  value 
point  evaluated  at  this  step. 

Step  4:  (First  derivatives  for  manifold  directions  not  updated.)  Let  q,  denote  the  row  of  Q 
corresponding  to  the  j  th  row  of  G,  which  is  not  to  be  explicitly  updated  this  itera¬ 
tion.  Then  evaluate  /  at  one  point,  a  feasible  stepsize  along  <r tqt  away  from  x. 
Using  the  value  of  G#  from  the  last  iteration,  and  the  new  function  value,  com¬ 
pute  a  second  order  approximation  to  gj.  Add  g2  to  t)2. 

Update  xB  if  appropriate. 

Step  5:  (Compute  mixed  second  partial  derivatives.)  Consider  each  row  q,  as  in  Step  3. 
Then  for  every  other  qk  corresponding  to  the  /  th  row  of  G,  I  <  j ,  evaluate  / at  a 
feasible  point  appropriately  chosen  along  q,  +  qk>  and  compute  the  mixed  second 
partial  derivative  Gjt  (this  fills  in  the  "pipe"  as  in  Section  1). 

Update  xB  if  appropriate. 

Incorporate  all  the  new  second  derivatives  into  the  matrix  factors  of  G. 

Note  that  (n  -  u)/n  passes  through  this  step,  with  the  same  N,  will  explicitly  update  all 
of  G,  using  —  in  -  u)2  -  (n  -  u)  evaluations.  In  the  worst  case  (m  -  0),  then,  y  n(n  -  1) 
evaluations  are  used,  on  the  average. 

The  factors  of  G  may  be  updated  at  this  point  using  any  rank-one  or  rank-two  formula 
which  does  not  force  positive  definiteness  on  G  on  an  infinite  number  of  iterations. 


Updating  row/column  /  of  G,  for  /  ^  I,  is  equivalent  to  adding  a  rank-two  update  of  the 
form  wej  +  etwT,  where  w/+,  »  ...  -  wn  -  0,  is  the  /th  column  of  the  identity  matrix,  so  that 
if  a  scheme  utilizing  this  special  structure  was  devised,  at  most  n  updating  passes  using 
Sorenson’s  method  [171  would  have  to  be  performed.  We  have  not  yet  devised  a  procedure 
exploiting  this  structure.  Our  code  uses  a  strategy  that  is  reasonably  efficient  when 

7r  <  y  (n  -  w),  and  does  two  rank-one  modifications  for  each  row/column  updated— the 


second  of  which  wipes  out  the  undesired  results  of  the  first.  Using  the  operation  count  in  [181, 
between  2ir(n—  u)2  +  0(n)  and  (1  l/3)ir(n  —  u)2  +  0(n)  operations  are  required  for  this 
updating.  Full  details  on  updating  the  Bunch-Parlett  factorization  are  given  in  [17). 


Step  6:  (Check  accuracy  of  derivative  approximations.)  If  G  is  positive  semidefinite  and 
the  gradient  norm  17  is  sufficiently  small,  stop;  x  is  an  acceptable  solution. 
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If  G  is  positive  definite  but  the  stepsize  is  too  big  relative  to  the  gradient  norm,  i.e.. 
s  >  (17/a),  set  the  failure  indicator  r  =  1  and  go  to  Step  12,  where  the  stepsize  will  be  reduced 
with  no  movement  made.  Otherwise,  scale  the  Newton  search  direction  d  relative  to  the  ,Y~ 
directions.  If  G  is  positive  definite,  go  to  Step  7,  otherwise  to  Step  8. 

Step  7:  (Eigenanalysis  when  Hessian  approximation  is  not  positive  definite.)  Determine  Z 
and  A  such  that  Z  is  orthogonal,  A  is  diagonal,  and  D  =  Z\ZT.  Let  Aw  denote 
the  most  negative  eigenvalue  of  D.  If  Aw  =  0,  D  is  positive  semidefinite;  go  to 
Step  8.  Otherwise,  a  direction  of  strictly  negative  curvature  exists.  Solve 
MTy  =  ZQ  for  it.  If  the  gradient  norm  is  too  small,  go  to  Step  9. 

Step  8:  (Compute  direction  of  line  search.)  Solve  MiD  +  ZEZ1)  MT 8  =  —g  for  8,  where 
£  is  a  diagonal  matrix  given  by  EJt  =  [max{|A„l,  yin  -  u)K.  y)  -  A,,)  for 
j  —  1,2 . n  —  u,  and  A  =  max  | A 1 .  £  is  thus  the  "smallest"  diagonal 

I  ^  ^  n  —  u 

additions  matrix  necessary  to  force  positive  definiteness  on  G.  If  D  is  positive 
definite,  go  to  Step  10  and  search  only  along  the  usual  Newton  direction.  If  D  is 
not  positive  definite,  consider  either  the  "fixed-up  Hessian"  direction 
l/VT'r/  +  Qlh  or  the  quadratic  curve  {.v,l.v,  =  .v  +  t[iN^)Td  -  0»jel  + 
t2UN+)rd  +  0o.vl),  where  Qn  is  the  submatrix  consisting  of  the  last  in  -  it)  rows 
of  0.  Note  that  (A '*)1d  -  is  just  the  negative  gradient  in  QR  coordinates. 
The  second  vector,  iN+)Td  +  Qly\  is  guaranteed  to  be  a  descent  direction  for  /  if 
Aw  <  0  and  the  derivative  approximations  are  sufficiently  accurate.  Our  curve 
search  is  thus  similar  to  that  of  McCormick  [121  and  More  and  Sorenson  [14], 
except  that  we  reverse  the  t  and  t 2  terms.  Our  motivation  for  doing  this  is  that, 
unlike  them,  we  allow  a  stepsize  t  to  exceed  one.  Since  the  gradient  direction  is 
most  useful  only  close  to  the  current  point,  we  wanted  it  to  be  multiplied  by  t 
rather  than  t2. 

If  the  curve  is  to  be  searched,  go  to  Step  9;  if  the  "fixed-up"  Hessian  direction,  go  to  Step 

10. 


Step  9:  (Search  along  a  curve.)  Use  a  one  variable  search  method  to  find  a  ^  r  such 
that  the  point  xf.  satisfies  the  two  term  Armijo-type  acceptability  condition 

/(*,.)  -  fix)  ^  pit* r)2  +  it*)2yTg  +  y(/*)4vrG>'l.  trying  t  -  1  first.  If  success¬ 
ful,  let  xB  =  xr.  In  either  case,  go  to  S  ep  1 1. 

Step  10:(Regular  line  search.)  Search  for  a  t  >  0  such  that  fix  +  t{(\*)Td  +  Qnft\)  ~ 
fix)  <  p[tidTbf  +  8 Tg)  +  y  r28rC],  trying  r-  1  first.  If  successful,  redefine 
xB. 

Stepll:(If  the  move  point  is  good  enough,  reduce  the  stepsize.)  If  fixB,~ 
fix)  >  - a2/32s 2,  there  has  not  been  sufficient  function  value  decrease;  go  to  Step 
12.  If  fixB)  —  fix)  <  — a2fi2s 2,  set  the  local  variations  failure  indicator  r  —  0.  If 
fix b)  -  fix)  <  fiW'  ^ere  has  been,  in  addition,  sufficient  decrease  relative  to 
the  gradient  norm,  so  choose  a  new  smaller  stepsize  s' and  go  to  Step  13.  Other¬ 
wise,  x  —  xB\  go  to  Step  1.  having  moved  to  ab,  but  using  the  same  stepsize  on 
the  next  iteration. 
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Step  1 2 : (Failure;  try  reversing  the  local  variations  directions.)  Reverse  the  cr  direction 
indicators.  If  r  =  0,  this  is  the  first  time  this  step  has  been  encountered  with  the 
current  xand  y,  set  r  —  1  and  go  to  Step  1.  Otherwise,  if  r  =  1,  let  the  new  step- 
size  s'  =  s/2,  set  r  =  0,  xB  —  x,  and  go  to  Step  13. 

Step  13:(Define  a  new  sequence  point.)  If  x  ^  xk.  k  —  k  +  1,  x*  *—  x,  sk  —  s,  x  —  xB. 
and  s  —  s'.  Go  to  Step  1. 

4.  CONVERGENCE 

The  asymptotic  convergence  behavior  of  our  method  is  similar  to  that  of  our  modified 
Newton  method  [10]  from  which  it  inherits  its  coordinate  system  and  searches,  and  our  uncon¬ 
strained  method  [9]  from  which  it  obtains  its  second  derivative  matrix  approximation  approach. 
Proofs  for  the  theorems  below  are  thus  obtained  by  extending  results  from  [9]  in  the  manner  of 
[10],  so  that  we  omit  the  complete  proofs  and  indicate  only  the  key  points  used. 

THEOREM  1:  (First  order  convergence.)  If /is  continuously  differentiable  on  an  open 
convex  set  containing  the  bounded  set  [x\ATx  ^  b ,  fix)  ^  fix0)},  then  the  method  con¬ 
verges  to  a  point  satisfying  the  Karush-Kuhn-Tucker  first  order  necessary  optimality  conditions. 

PROOF:  Since  fix)  is  bounded  from  below,  we  must  have  that  {sk}  is  infinite,  unless  the 
sequence  (x*)  is  finite.  If  so,  we  can  show  that  the  last  point  found  satisfies  the  optimality 
conditions.  Now  assume  (s*)  is  infinite.  Then  [y*]  —  0,  and,  by  the  acceptability  tests  at  Steps 
6,  10,  and  II,  we  have  that  the  gradient  norm  is  bounded  above  by  a  function  of  the  stepsize, 
so  that  the  projected  gradient  norm  also  goes  to  zero. 

The  next  two  results  all  require  that  /  be  twice  continuously  differentiable,  jx*j  converge 
to  a  unique  point  x,  at  which  strictly  complementary  slackness  holds,  and  that  there  is  some 
finite  bound  on  the  norm  of  the  largest  element  of  the  true  projected  Hessian  at  x.  which  we 
denote  H. 

THEOREM  2:  (Second  order  convergence).  If,  in  addition  to  the  above.  V:/H  satisfies 
a  Lipshitz  condition  in  a  neighborhood  of  x,  and  r  is  chosen  small  enough,  x  satisfies  the 
second  order  necessary  optimality  condition  that  H  be  positive  scmidefinite. 

PROOF:  This  result  follows  from  the  explicit  updating  we  do  of  G  If  H  is  not  positive 
semidefinite,  then  U  has  at  least  one  strictly  negative  eigenvalue.  Since  {x*j  —  x,  and,  by  our 
updating,  [G*]  —  H,  eventually  our  curve  search  direction  will  be  sufficiently  close  to  the 
eigenvector  direction  associated  with  that  negative  eigenvalue,  and,  if  r  is  small  enough,  the 
search  at  Step  9  will  be  able  to  be  successful  infinitely  often.  Since  r  is  bounded  away  from 
zero,  this  would  imply  that  |/(x*))  —  -«>,  so  that  //must  be  positive  semidefinite.  The  need 
for  a  "sufficiently  small  r”  is  due  to  the  fact  that  the  upper  limit  of  the  interval  over  which  t* 
may  be  chosen  to  satisfy  the  acceptability  test  at  Step  9  is  a -function  of  the  magnitude  of  the 
most  negative  eigenvalue  of  //,  and  t  must  be  chosen  smaller  than  that  upper  limit  for  the 
proof  to  follow. 

THEOREM  3:  If,  in  addition  to  the  above,  H  is  sufficiently  positive  definite  in  a  feasible 
neighborhood  of  x,  the  convergence  rate  of  the  algorithm  is  superlinear.  If  a  Lipshitz  condition 
also  holds,  order  2  convergence  holds. 
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PROOF:  These  two  results  follow  from  making  our  approximated  Newton  line  search 
direction  computed  at  Step  8  be  sufficiently  close  to  the  true  direction  which  would  be  com¬ 
puted  using  exact  derivatives.  The  stepsize  t  =  I  will  then  be  feasible  and  acceptable,  and.  by 
McCormick  and  Ritter  [13,  Theorem  2!,  the  rates  claimed  follow. 

5.  MODIFYING  THE  SECOND  DERIVATIVE  MATRIX 

The  algorithm  maintains  an  approximation,  G,  to  the  second  derivative  matrix  projected 
onto  the  coordinates  defined  by  the  last  (n  —  u >  rows  of  Q  M  and  D  factors  can  be  main¬ 
tained  in  the  lower  triangle  of  G,  and,  since  G  is  symmetric,  it  is  stored  explicitly  in  the  upper 
triangle  of  the  same  array.  G  can  thus  be  used  to  aid  in  adjusting  the  matrix  when  the  active 
set  changes.  Two  cases  must  be  considered— dropping  a  constraint  from  the  active  set,  and 
adding  one. 

Dropping  a  constraint  from  the  active  set  removes  a  column  from  R  and  a  row  from  A’+, 
and  adds  a  row  and  column  to  G.  An  identity  column  is  added  to  G,  requiring  no  arithmetic. 
(While  this  may  tend  to  imbalance  G,  since  it  will  intermix  a  direction  for  which  no  informa¬ 
tion  is  available  with  those  for  which  much  better  approximations  are  known,  we  compensate 
by  making  the  new  column  the  first  candidate  for  explicit  updating.)  The  amount  of  work 
involved  in  updating  Q  and  R  will  depend  on  the  number  of  constraints  deleted  and  their  rela¬ 
tive  positions  in  the  active  set.  Removing  a  single  constraint  would  require  from  0  (last  con- 

strain!  dropped)  to  3iw  +  —  u2  +  0(n)  multiplications,  3un  +  y  tr  +  0(n)  additions,  and 

(u  —  1)  square  roots  (first  constraint  dropped)  to  update  R ,  and  an  additional 
3[(m  -  2)  (w  -  l)/2l  +  (w  -  1)0(1)  multiplications/additions  for  updating  Q ,  using  the 
arrangement  in  [51. 

Adding  a  constraint  to  the  active  set  adds  a  column  to  R  and  a  row  to  A+,  deflates  G  from 
u  x  u  to  («  -  I )  x  (</  -  I ),  and  modifies  Q  n  multiplications  and  n  additions  are  required  to 
determine  the  new  column  of  R,  and  then  an  (n  —  u)  vector  must  be  reduced  using  orthogonal 
matrices.  Using  straightforward  multiplication,  the  in  —  u  —  1)2  x  2  Givens'  matrices  each 
require  one  square  root,  4  multiplications  and  one  addition.  Premultiplicalion  of  Q  involves 
3/i («  -  u)  -  3n  +  0 </i  -  u)  multiplications  and  additions.  Updating  of  G  is  simplified  by 
using  those  transformations  that  will  zero  out  all  but  the  last  entry  in  the  vector,  as  in  Gill  and 
Murray  [7).  We  would  like  to  determine  triangular  factors,  LLT.  for  G,  since  then  premultipli¬ 
cation  by  the  successive  Givens'  transformations  would  change  L  to  lower  Hessenberg  form, 
and  postmultiplication  would  restore  triangularity. 

Now  G  is  recurred  as  MD\tr .  vi:h  D  consisting  of  1  x  I  and  2x2  blocks,  and  difficulties 
arise  when  a  1  «  I  block  is  nonpositive  or  whenever  a  2  x  2  block  exists,  that  is,  whenever 
there  is  a  nonpositive  eigenvalue  Fortunately,  a  fix  up  of  G  to  be  positive  definite,  so  as  to 
have  LL1  factors,  is  easily  done,  since  whenever  G  is  not  positive  definite  at  this  step,  the  com¬ 
plete  eigensystem  of  D  has  already  been  determined  during  the  previous  execution  of  Step  7. 
Note  that  this  modification  of  G  would  have  to  be  performed  once,  before  each  set  of  con¬ 
straints  is  added,  and  it  will  tend  to  modify  the  negative  curvature  information  we  try  to  main¬ 
tain.  Again,  a  least-constrained  approach  should  tend  to  minimize  such  modifications. 

The  method  used  follows  ideas  in  Sorenson  [17],  and  is  an  attempt  to  replace  G  with  the 
"nearest"  positive  definite  matrix,  i.e.,  G  —  MDMT  is  replaced  with  G  -  M(D  +  ZEZT)MT. 
The  exact  effect  on  the  second  derivative  information  in  G  is  not  clear,  but,  since  we  were  not 
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able  lo  derive  any  scheme  for  preventing  the  indefiniteness  of  G  which  would  be  more  efficient 
than  an  entire  refactorization,  we  opted  for  the  fix-up  approach.  Note  that  this  fix  up  would  be 
exactly  what  we  would  have  obtained  if  we  had  used  the  Bunch-Parlett  factorization  and  forced 
the  projected  Hessian  approximation  to  be  positive  definite. 

The  fix-up  of  G  and  determination  of  the  LLT  factor  requires  between 
—  (n  —  u)  (n  —  u  -  I)  multiplications,  no  additions,  and  (n  —  u)  square  roots  in  the  case  of 

all  1  x  1  blocks,  to  ( n  —  u )2  +  4 (n  -  u)  multiplications,  ( n  -  u)2  +  (n  —  u  —  6)  additions, 

and  (n  —  u)  square  roots  in  the  case  of  all  2  x  2  blocks.  The  subsequent  straightforward  mul¬ 
tiplication  of  the  lower  triangular  factor  and  its  postmultiplication  to  regain  triangularity 

involves  4 («  —  u)2  —  (n  —  u  +  2)  multiplications  and  2(n  —  u)2  —  (n  —  u)  additions.  It 

should  be  noted  that  the  reduction  of  the  (n  -  u)  vector 


V| 

v2 


to  1 1 v | \e„_u  by  successive  premultiplication  by  (n  -  u)  2  x  2  Givens'  matrices  of  the  form 


could  be  replaced  with  a  single  multiplication  by  the  matrix 


pn-u  pn-u- 1  p3fj2  _ 
*n~u~  1»  *n-u~ 2  •••  '  2‘  1 


G 

*1 

0 

0 

0 

s,c2 

-c,c2 

*2 

0 

0 

s,s2c3 

-C,S2C3 

-c2c3 

*3 

0 

S|S2S3C4 

— C|S2S3C4 

-c2s3c4 

-c3c4 

s4 

which  is  a  special  lower  Hessenberg  matrix  in  the  sense  of  [5],  Thus,  savings  of  approximately 
25%  of  the  work  involved  in  premultiplying  the  lower  triangular  factor  of  G  could  be  saved  by 
using  Lemma  IV  of  Section  2  of  [5]. 

If  constraints  have  been  added  to  the  active  set,  N+  increases.  Due  to  the  complexity  of 
updating  V+,  especially  since  several  constraints  are  often  added  at  the  same  time,  N+  is 

recomputed  at  such  times.  Recomputation  of  /V+  requires  ~u2n  multiplications  and 
y  u(u  —  1  )n  additions. 
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6.  COMPUTATIONAL  RESULTS 

The  essential  motivating  factor  in  this  research  was  our  feeling  that  a  partial  explicit 
update  of  the  second  derivative  matrix  might  prove  to  be  superior  to  the  usual  quasi-Newton 
strategy.  We  coded  our  method  to  see  whether  numerical  experimentation  would  be  consistent 
with  that  conjecture.  (A  copy  of  that  code  may  be  obtained  from  the  author.) 

There  are  actually  two  different  major  hypotheses  to  test  here.  First,  we  wanted  to  see  if 
varying  ir,  the  number  of  rows/columns  in  the  update  "pipe,"  could  improve  convergence 
behavior.  We  tested  this  using  different  values  of  n  in  our  new  code,  and  by  comparison  with 
our  older  modified  Newton  method,  thus  holding  the  algorithmic  structure  constant.  Second, 
we  needed  to  see  whether  our  new  method  is  competitive  with  other  current  software.  For  this 
comparison,  we  used  a  state-of-the-art  quasi-Newton  method,  Murtagh  and  Saunder's  MINOS 
[15]  algorithm.  On  a  secondary  level,  we  were  curious  as  to  how  the  inclusion  of  a  low  rank 
update  might  affect  the  results,  so  we  also  ran  the  test  problems  using  Broyden's  rank-1  update 
and  the  complementary  DFP  rank-2  update.  Note  that  the  latter  forces  positive  definiteness  on 
G. 


As  with  similar  numerical  experimentation  with  optimization  codes,  our  results  cannot  be 
considered  an  absolute  conclusion  for  either  hypothesis.  The  test  results  using  our  methods  are 
affected  by  the  parameter  settings,  particularly  the  starting  stepsize.  While  we  tried  to  find  good 
values  for  each  instance,  other  parameter  settings  might  have  been  optimal.  The  path  our  algo¬ 
rithm  took  to  the  solution  may  differ  with  tt,  so  that  the  results  may  be  an  artifact  of  the  start¬ 
ing  point.  Finally,  we  used  the  default  parameters  in  MINOS  for  runs  through  it.  A  more 
experienced  MINOS  user  might  have  been  able  to  speed  its  convergence  by  setting  its  various 
parameters  differently.  Note  that  our  stopping  criterion,  since  it  is  a  second  order  one,  is  more 
stringent  than  that  of  MINOS. 

These  caveats  aside,  our  experiments  do  tend  to  support  our  conjectures.  Table  1  sum¬ 
marizes  the  number  of  function  evaluations  used  until  convergence  (i.e.,  agreement  with  the 
optimal  /  value  to  10  decimal  places)  for  the  best  configuration  of  our  new  method,  our 
modified  Newton  method,  and  MINOS  for  six  popular  test  problems.  The  results  are 


TABLE  1  —  Comparative  Function  Evaluation  Counts 


Problem  and 

Best  Form  of  the  New  Method 

Our  Modified 

Newton 

Method 

MINOS 

Source 

IT 

Update 

Evaluations 

Gauthier  [3] 
(Colville  #7) 

3 

Rank  2 

111 

187 

496 

Hydrazine  Equil¬ 
ibrium  (4) 

H 

None 

213 

223 

561 

Rosenbrock  [16] 

None 

109  [9] 

97 

Shell  Primal  [3] 
(Colville  #1) 

0 

None 

44 

50 

Water  Quality  [10] 

2 

None 

271 

480 

Wood  (31 
(Colville  #4) 

3 

None 

245  [9] 

447 

_ 
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particularly  striking  for  the  Water  Quality  problem,  which  is  a  nonconvex  1 1  variable  model 
derived  from  a  model  of  the  Willmette  River.  The  new  method  performed  about  the  same  as 
our  modified  Newton  one  on  Hydrazine  Equilibrium,  Rosenbrock's  banana  function,  and  Shell 
Primal,  and  did  much  better  on  the  other  three. 

To  further  see  the  effect  of  modifying  it,  Figure  1  shows,  graphically,  the  number  of 
function  evaluations  required  for  convergence  on  the  four  linearly  constrained  problems.  Shell 
Primal  is  a  relatively  easy  cubic  problem,  and  Figure  1A  shows  that  not  much  benefit  results 
from  either  increasing  it  or  adding  a  low  rank  update.  The  best  performance  occurred  for 
it  —  0  and  no  update;  since  we  start  with  an  identity  matrix  for  G  this  is  nothing  more  than 
steepest  descent. 

Gauthier’s  problem  is  a  16  variable  quartic.  In  Figure  IB  the  convex  pattern  with  respect 
to  it  that  was  so  marked  for  unconstrained  problems  [9]  appears  for  the  "no  update"  and  "rank- 
2  update"  curves.  That  is,  an  intermediate  value  of  it  is  superior  to  either  the  modified  Newton 
(it  —  n)  or  quasi-Newton  (it  —  0)  strategy.  The  pattern  for  the  rank-1  update  is  different,  con¬ 
founded  in  part  by  local  variations  moves.  Since  the  method  moves  to  the  better  of  the  points 
found  by  line  (or  curve)  search  or  points  used  for  derivative  approximations,  fortuitous  values 
of  the  stepsize  and  coordinate  system  sometimes  lead  to  fast  convergence. 

Hydrazine  equilibrium  is  a  tightly  constrained  10  variable  problem.  The  feasible  region  is 
small,  and  the  method  spends  much  time  changing  constraint  sets  (the  optimum  is  strictly  inte¬ 
rior  to  the  inequality  constraints).  Again,  particularly  lucky  searches  on  local  variation  moves 
had  an  impact  on  the  convergence.  The  pattern  of  Figure  1C  is  a  bit  confusing,  although  the 
modified  Newton-like  strategy  of  larger  jt  seems  to  be  desirable. 

Finally,  the  performance  of  the  Water  Quality  problem  is  illustrated  in  Figure  ID.  Since  a 
rank-1  or  rank-2  update  cannot  be  performed  unless  the  same  constraint  set  is  used  for  two 
consecutive  iterations,  the  constant  changes  of  constraint  sets  meant  that  such  updates  were 
never  performed.  Here  the  "U"  shaped  patterns  appear  again,  lending  support  to  the  idea  that 
some  explicit  updating  is  better  than  none,  but  that  a  complete  update  is  usually  not 
worthwhile.  This  is  also  a  rather  hard  problem;  note  that  MINOS  required  2,292  function 
evaluations  to  converge. 

Table  2  shows  the  relative  percentage  of  time  used  by  each  part  of  our  algorithm.  Our 
code  is  an  experimental  one,  written  by  the  author,  so  that  a  more  skillful  implementation 
might  redistribute  these  somewhat.  Test  runs  were  performed  on  the  University  of  Pittsburgh’s 
DEC  1099  in  the  usual  batch  environment.  In  this  situation,  times  tend  to  vary  by  ±15%. 
The  standardized  times  reported  in  the  last  two  columns  were  computed  relative  to  the  28.48 
seconds  required  for  Colville’s  [3]  timing  routine  (average  of  3  runs). 

The  bulk  of  the  time  required  by  the  new  method  was  for  the  numerical  derivative 
approximations.  This  was  reassuring,  since  a  basic  idea  in  using  a  nonderivative  method,  such 
as  ours,  is  to  trade  off  increased  computer  time  for  the  human  time  necessary  to  compute  the  deriva¬ 
tives  analytically. 

Finally,  the  standardized  time  ratio  for  our  new  method  compares  favorably  with  that  of 
MINOS.  The  percentage  reduction  in  time  for  our  method,  relative  to  MINOS,  ranges  from 
37-70%  for  the  constrained  problems  and  is  about  90%  for  the  unconstrained  ones.  Considered 
together  with  the  additional  time  necessary  to  derive  and  program  the  derivatives  for  a  code 
such  as  MINOS,  we  can  conservatively  conclude  that  our  new  method  is  at  least  competitive 
with  a  representative,  state-of-the-art,  quasi-Newton  algorithm. 
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TABLE  2  —  Timing  Results 
Time  (sec.) 


Problem 

New  Method 

MINOS 

Setup;  Finding 
Initial 

Feasible  Point 

Evaluations 

and 

Derivatives 

Constraint  Handling 
and 

Matrix  Updating 

Searches 

Standardized 

Time 

Ratio  (total) 

Standardized 

Time 

Ratio  (total) 

Gauthier 

28% 

38% 

17% 

17% 

00557 

00885 

Hydrazine 

16% 

75% 

2% 

6% 

0.0331 

0.0572 

Rosen  brock 

- 

— 

— 

— 

0.0015119! 

0.0330 

Shell  Primal 

56% 

28% 

16% 

<  1% 

0.0109 

00372 

Water  Quality 

14% 

61% 

14% 

12% 

0.1131 

0.1921 

Wood 

- 

- 

- 

- 

0.0066119) 

0.0509 

7.  CONCLUSIONS 

Our  thesis,  in  this  paper,  is  that  explicitly  updating  a  few  columns  of  the  approximated 
(projected)  matrix  of  second  partial  derivatives  at  each  iteration  may  yield  convergence 
behavior  superior  to  that  of  traditional  quasi-Newton  algorithms.  We  presented  a  method  for 
linearly  constrained  optimization,  incorporating  that  idea  into  our  existing  algorithmic  frame¬ 
work.  The  user  specifies  the  number  of  rows/column  to  be  updated  at  each  iteration.  Theoret¬ 
ical  convergence  proofs  show  that  such  a  method  retains  the  properties  of  our  modified  Newton 
algorithm— convergence  to  a  second-order  Kuhn-Tucker  point,  and  superlinear  rate.  Numerical 
experiments  indicate  that  this  approach  is  competitive  with  a  state-of-the-art  quasi-Newton 
code,  and  that  a  small  number  of  explicitly  updated  rows/columns  (usually  2  or  3)  can 
significantly  improve  the  speed  of  convergence. 
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ABSTRACT 

In  this  paper,  we  develop  efficient  interactive  methods  Tor  the  solution  of 
bicriteria  nonlinear  programming  problems.  The  methods  do  not  require 
trade-off  information  from  the  decision  maker,  pose  less  cognitive  burden  and 
converge  to  the  "best  compromise  solution"  fast  Two  methods,  called  the 
paired  comparison  method  and  comparative  trade-oflf  method,  are  presented 
with  examples.  A  real  application  of  the  interactive  method  to  a  bicriteria  prob¬ 
lem  that  arose  in  the  planning  of  the  cardiovascular  disease  control  program  in 
the  U.S.  Air  Force  is  also  presented. 


INTRODUCTION 

Bicriteria  mathematical  programming  (BCMP)  problems  as  a  first  generalization  of  single 
criterion  nonlinear  programs  have  been  studied  by  many  researchers  [I,  2,  3,  4,  7,  13,  17J. 
The  small  dimensionality  of  the  problem  permits  a  visual  presentation  of  the  "payoff  set,"  which 
partly  explains  such  widespread  interest.  Further,  when  we  restrict  ourselves  to  "efficient  solu¬ 
tions,"  the  complementary  roles  of  the  two  criteria  yields  additional  computational  power. 

The  BCMP  problem  may  be  stated  as  follows: 

(I)  VMAX  {/,(*). 

Subject  to  &(x-)  <0  i  —  1 . m 

where  xis  an  n  dimensional  vector  of  decision  variables,  ,/j  and  J2  are  real  valued  criterion  func¬ 
tions  and  /?,'s  represent  a  set  of  nonlinear  constraints.  Let  S  -  (xlft(x)  <  0)  denote  the  feasi¬ 
ble  region.  An  optimal  solution  of  BCMP  will  be  taken  to  be  a  "best  compromise  solution"  that 
maximizes  the  preferences  of  a  Decision  Maker  (DM);  that  is,  a  feasible  solution  that  solves 
the  following  program. 
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(2)  Max  U[f\(x),f2ix)\ 

Subject  to  x  €  S 

where  U  is  the  DM’s  preference  function  or  value  function  defined  over  the  criterion  values 
</ J2)  such  that  for  any  two  alternatives  X\  and  x2 ,  £/[./j(x|),  /2(jC|)]  >  U  [yj  (xi),  /j(x2)).  if 
the  DM  prefers  X|  to  x2. 

In  one  of  the  earlier  papers  addressed  to  the  bicriteria  problem,  Geoffrion  |7]  considered 
the  "scalar  maximization  approach,"  i.e.,  maximizing  a  convex  combination  of  the  two  criteria. 
He  showed  how  such  problems  can  be  numerically  solved  using  parametric  programming  algo¬ 
rithms.  Bacopoulas  and  Singer  [21  on  the  other  hand,  used  the  "constrained  criteria  approach.” 
i.e.,  maximizing  one  criterion  keeping  the  other  criterion  as  a  constraint.  They  showed  how  all 
the  efficient  solutions  can  be  generated  by  parametrically  varying  the  level  of  the  constrained 
criterion  over  a  particular  interval.  Some  further  refinements  and  alternate  proofs  appear  in 
Gearhart  [6)  and  Benson  [31.  Pasternak  and  Passy  [13],  addressed  the  special  case  when  the 
variables  are  zero-one  and  devised  a  special  enumerative  algorithm  for  generating  the  efficient 
solutions.  Choo  and  Atkins  [4]  address  the  case  when  the  criteria  are  linear  fractional  func¬ 
tions.  Adulbhan  [ I  ]  considers  the  special  case  when  the  criteria  are  linear.  In  all  these 
methods,  the  emphasis  is  on  generating  all  the  efficient  solutions  of  BCMP.  But  Walker  [17] 
considers  the  "interactive"  approach  and  uses  the  Generalized  Lagrange  Multiplier  method  of 
Everett  [5].  Walker’s  method  is  slow  in  convergence  and  seeks  difficult  interaction  from  the 
DM. 


In  this  paper  we  will  consider  interactive  approaches  to  the  solution  of  BCMP  where  the 
preference  function  U  is  only  implicitly  known.  Such  an  approach  in  the  more  general  context 
of  multiple  criteria  problems  have  been  addressed  by  Geoffrion,  Dyer  and  Feinberg  [81  and 
Zionis  [191.  However,  Wallenius  H8)  has  observed  that  such  methods  pose  considerable  cogni¬ 
tive  burden  on  the  part  of  the  DM;  also  the  procedures  converge  to  the  optimal  solution  rather 
slowly.  In  this  paper,  we  develop  interactive  procedures  that  pose  less  cognitive  burden  to  the 
DM  and  converge  to  the  optimal  solution  fast. 

Section  2  presents  the  principal  results  that  form  the  basis  of  the  interactive  methods 
developed  for  BCMP.  Section  3  discusses  one  of  the  interactive  approaches  called  the  Paired 
Comparison  Method.  Section  4  discusses  another  approach  called  the  Comparative  Trade  offi 
Method.  Section  5  discusses  a  real  application  of  the  interactive  method  to  a  bicriteria  problem 
that  arose  in  the  control  of  cardiovascular  disease  in  the  li.S.  Air  Force. 

2.  PRINCIPAL  RESULTS 

In  this  section,  we  will  establish  a  number  of  results  that  form  the  basis  of  the  new 
interactive  procedures  which  are  develolped  in  the  later  sections.  We  will  assume  throughout 
that  the  feasible  region  S  is  a  compact  convex  set;  the  criterion  functions  f\  and  f2  are 
differentiable  concave  functions  and  the  preference  function  U  is  differentiable,  increasing  and 
strongly  quasiconcave  defined  in  the  following  manner. 

DEFINITION  1:  /:  S  — =>  R\  where  S  is  a  nonempty  convex  set  in  Rn  is  said  to  be 
strongly  quasiconcave  on  S,  if  for  each  X|,  x2  €  5  with  x,  ^  x2  and  for  every  X  €  (0,  1),  we 
have 

(3)  /[Xx|  +  (1  -  \)x2]  >  Min  [f(x\),f{x2)]. 
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The  convexity  assumptions  on  criteria  and  feasible  region  are  made  only  to  allow  the 
methods  of  convex  programming  to  be  applicable.  The  quasiconcavity  assumption  of  L  is  well 
supported  by  preference  theory  (see  Intriligator  [10]). 

Use  of  the  concavity  assumptions  guarantee  the  attainment  of  maxima  and  quasiconcavity 
ensures  us  that  local  optima  are  also  global  optima.  In  addition  it  is  sufficient  to  restrict 
ourselves  to  efficient  solutions  defined  below  as  candidates  for  the  optima. 

DEFINITION  2:  A  solution  x°  €  5  is  said  to  be  efficient  to  BCMP.  if  / ,  (.v )  >  / ,  <  V )  for 
some  x  €  S  =>  /2(x°>  >  /2(x). 

We  will  now  develop  procedures  for  generating  the  efficient  solutions  and  then  a  method 
of  "efficiently"  searching  among  the  efficient  solutions  for  the  "optimal"  solution. 

(4)  Define  the  payoff  set  Y  =  (y|/(x)  =  y  for  x  6  S). 

(5)  Let  A  =  {>’ |/(x)  ^  v  for  x  €  5],  Obviously,  Y  C  A. 

LEMMA  1:  If  S  is  a  convex  set  and  /is  concave,  then  A  will  be  a  convex  set  [12] 

Consider  the  following  single  objective  nonlinear  programs: 

PI:  Max/|(x)  P2:  Max  ,/'2(x) 

Subject  to:  x  €  S  Subject  to:  x  €  S 

Let  the  optimal  values  of  (PI)  and  (P2)  be  v*and  w*  respectively.  Now  consider  the  fol¬ 
lowing  mathematical  programs: 

Py.  Max /2(x)  Qw\  Max,/j(x) 

Subject  to:  x  €  S  Subject  to:  x  €  S 

/|(x)  ^  v  /2(x)  >  H' 

Let  x*  solve  Qw  for  w  =  w*.  Then  ,/j(x2)  is  the  minimum  achievable  value  for  J\ 
without  sacrificing  any  achievement  on  /2,  while  v*is  the  maximum  value  of  ,/j  at  the  expense 
of  f2.  Hence,  the  range  of  achievable  values  for  ./j,  denoted  by  v,  is  given  by  (,/j(x2),  v‘]  and 
the  best  compromise  value  for  f\  lies  in  this  range. 

THEOREM  1:  In  the  optimal  solution  to  the  program  where  v  €  [/j(x2),  v*],  the 
constraint  ,/j(x)  >  v  will  be  a  binding  constraint,  i.e.,  if  x  solves  P 5  then  /1  (3c >  -  v. 

PROOF:  Assume  the  contrary,  i.e  , 

(6)  /,(x)  >  v  >  /,(xf) 

w*  being  the  absolute  maximum  of  f2  over  5, 

(7)  w*  -  /2(xJ)  >  /2(x) 
we  will  consider  the  two  cases 

(8)  (i)  Let  /2(xf)  -  f2(x) 

(6)  and  (7)  taken  together  contradict  the  fact  that  xj  solves  Qt . 
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(9)  (ii)  Lei  ,/2(x2)  >  /2(x) 

consider  the  line  segment  lx  x*l  which  lies  in  the  convex  sei  S.  Over  this  line  segment,  the 
concave  functions  J\  and  f\  are  unimodal.  Because  of  “strict"  inequality  in  (6)  and  (9).  there 
exists  a  feasible  solution  x'  on  the  line  segment  lx,  xji  such  that 

(10)  /,(.v)  >  /,  <x  )  >  v 
,/2(x)  <  /2(x’)  <  /2(xf). 

In  equality  (10)  indicates  that  x'  is  a  feasible  solution  to  P5  with  /2(x')  >  /2(x)  contradicting 
the  optimality  of  x. 

Hence,  /,(x)  =  v. 

THEOREM  2:  x  €  S  is  efficient  if  and  only  if  x  solves  P-  where  v  €  (/,  (x*).  v*]. 

PROOF 

Sufficiency:  Let  x solve  P-  where  v  €  [/|(x2).v*l,  We  will  show  that  x  is  efficient.  Con¬ 
sider  an  x  €  S  such  that 

/i(x)  >  ,/|(x)  =  v. 

It  is  clear  that  x  is  feasible  to  P-  But  x  cannot  be  optimal  to  P-:  as  per  theorem  I  every 
optimal  solution  x°  satifies  /|(x°)  -  v.  Hence,  /2(x)  <  /2(x).  Consider  an  x  6  5 such  that 

/2(x)  >  /2(x). 

Since  x  is  optimal  to  P-,  x  cannot  be  feasible  to  P-.  Hence,  /((x)  <  J\(x).  Consequently,  x 
must  be  efficient. 

Necessity:  Let  £  be  the  set  of  all  efficient  solutions.  Suppose  x  €  £.  Obviously  J\(x)  ^ 
/, (xp  as,  otherwise,  x*  will  dominate  x.  Let  /((x)  —  v.  Assume  that  xdoes  not  solve  P-  but 
some  other  x'  €  5  solves  P-.  By  theorem  1,  /)(x')  —  v  »  /|(x)  since  x'  is  optimal  for  P 
/j(x')  ^  /2(x):  but  x  €  £  — >  /2(x')  <  /2(x)  and  hence  /2(x')  -  /2(x).  Consequently,  x 
solves 

Theorem  2  enables  us  to  generate  the  entire  set  of  efficient  solutions  by  parametrically 
solving  P-.  By  theorem  1,  the  generated  solutions  will  have  specific  levels  of  attainment  of  /,, 
namely  v.  The  next  theorem  would  give  some  unimodality  properly  enabling  us  to  make  an 
efficient  parametrization. 

Consider  the  problem  where  we  maximize  U  over  the  pay-off  set  Y,  keeping  /,  fixed  at  v; 
i.e.. 


P3:  Max  t/(v,/2) 

<v./2)0- 

where  (v,/2)  €  K is  a  subset  of  /defined  by  {(v,/2)|/2  -  /2(x)  for  x  €  Sand  //x)  -  v| 


naval  research  logistics  quarterly 


VOL  29.  NO  3,  SEPTEMBER  1982 


INI  I  R  \(  1 1\  I  HI  (  Kii  I  KIA  M  \  llll  M  \  l  l(  M  I’KOGR  \MS 


447 


By  Theorem  1  and  the  fact  that  U  is  increasing  in  f2,  it  is  easy  to  see  that  an  optimal  solu¬ 
tion  to  (P3)  is  also  an  optimal  solution  to  program  (/\)  where  v  6  [./| <.v?) .  v *] .  We  now  estab¬ 
lish  the  unimodality  of  x'(v),  defined  below: 

THEOREM  3:  Let  Y'  be  a  convex  subset  of  R2  and  U(,f\,J2 )  is  a  strongly  quasiconcave 
increasing  function  of  J]  and  f\  defined  over  then 

,?(v)  =  Max  U(\.f\) 

i\.i2  ie>" 

is  strongly  quasiconcave,  i.e..  unimodal  in  v. 

PROOF:  Let 

#(v,)  =  Max  6/(v;,/'->)  =  6/<v,.w3)  (say) 
s(v4)  =  Max  t/(v4./3)  =  6(v4,h-4>  (say). 

<v4/,ier 

Assuming  boundedness  of  functions  the  maxima  are  attained;  by  convexity  of  }  and  strong 
quasiconcavity  of  U  they  are  attained  uniquely. 

Note  that  (v3,w3)  €  Y'  and  (v4,w4)  €  V  by  convexity  of  Y\ 

(Xv3  +  (1  -  X )v4,  X w,  +  (I  -  X ) m'4]  also  €  Y,  for  0  ^  X  ^  1 . 

XT [v3  +  (1  -  X)v4]  =  Max  £/[Xv3  +  ( I  -  XlVj./il 

(By  convexity  of  Y)  >  i/(Xv3  +  (1  -  X)v4,Xwj  +  (I  -  X>w4) 

(By  strong  quasi-  >  Min  (6Mv3,h-,.),  t/(v4,»-4)] 
concavity  of  60  =  Min  (xf  < v,) ,  xr  (v4)  ] 

Hence,  g(\)  is  unimodal  in  v. 

In  the  proof  of  the  above  theorem,  we  have  heavily  exploited  the  convexity  ol  >  When 
we  want  to  apply  the  result  to  Y,  the  pay-off  set,  we  find  that  the  assumption  of  convexity  of 
pay-off  set  may  be  overly  restrictive.  For  example  our  assumptions  of  convexity  ol  5  and  con¬ 
cavity  of  f/s  are  not  sufficient  to  guarantee  convexity  of  Y.  as  the  following  simple  example 
would  illustrate. 

EXAMPLE:  S  -  x  >  0:  /»  (,/T ,/2 )  =  (x.  -log  x)  Obviously,  5  is  a  convex  set  and  s 
are  concave.  But  the  pay-off  set  Y  is  the  graph  or  -  log  x  which  obviously  is  not  a  convex  set 
112). 

However,  we  will  presently  establish  the  fact  that  we  can  extend  the  pay-off  set  f  to  the  set  .4 
defined  in  Equation  (5)  without  affecting  the  results.  Lemma  I  established  the  convexity  of 
the  set  A.  Hence  identifying  A  with  Y\  the  result  of  theorem  3  holds  even  when  the  pay-off 
set  is  nonconvex. 
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We  will  establish  in  the  next  theorem,  the  equivalence  of  solutions  to  the  following  two 
problems. 

P4.  Max  U(a.f2 )  P5:  Max  f/(o,/2) 

(o./j)?  Y  (a  ./■>)€  4 

THEOREM  4:  The  optimal  solutions  to  (PA)  and  (P 5)  are  identical,  i.e.,  if  Ym  be  the  set 
of  optimal  solutions  to  (P4)  and  Am  be  the  set  of  optimal  solutions  to  (P5).  then  Ym  =  Am. 

PROOF: 

(a)  We  will  first  prove  that  every  element  of  A„  is  an  element  of  Ym  i.e.,  A„  D  Ym. 

Let  y*  =  (y*.yp  maximize  P5  for  a  =  y*  i.e.,  y*  €  Am.  We  have  to  show  that 
v*  6  Y  and  hence  y*  €  Ym.  Since  Am  C  A ,  by  definition  3  and  x *,  such  that 
fix*)  =  (/f./J)  >  y*.  Note  that  y*  €  Y  is  f(x*)—y*.  Assume  that 

fix*)  >  y\  i.e., 

(ID  ./?  >  yf.  ./?  >  y?  <  say). 

Since  U(/\.f2)  is  increasing  in  /|  and  /2,  (11)  gives 

(12)  f/(.vf,yj)  <  U(y*\,fi). 

But  from  the  fact  that  y*  maximizes  d/(>>f./2)  over  (.yf,/2)  6  A  we  get 

(13)  Uiylyp  >  Uiylfp. 

(13)  contradicts  (12);  hence  our  assumption  (11)  must  be  invalid  and  fix*)  -  y*.  By 
definition  of  the  set  Y,  y*  6  Y.  Being  a  maximizer  of  (P5)  for  a  =  y *  y *  6  Ym  and  hence 
Am  3  Ym. 

(b)  Assume  y*  €  Ym.  We  will  show  that  y*  €  Am  as  well  or  Ym  C  Am.  Since 
Y„,  C  Y  c  A,  we  get  y*  €  A.  Assume  y*  is  not  optimal  to  (P5).  Hence  3 
another  v' -  (y*,y2)  such  that 

V  (yf,y2)  -  Max  t/|yf,/2l 

(14)  f/(y*y2)  ^  U(y\,f2)  for  any  (yf,/2)  €  A 
and  in  particular  >  (/(y*,y2). 

Now  (yf,y2)  €  Am ;  by  part  (a)  of  the  proof  we  will  have  (y*  Y2)  €  Y.  By  optimality  of 
(yf.yj)  for  (P4) 

(15)  U (y*,y2)  <  U (y*.y *) 

(14)  and  (15)  imply  that 

U(yly2)  =  U(y*,y2) 

or  in  other  words  (yf.y?)  optimizes  (P5)  as  well.  Hence 
y*  6  Am  or  Ym  C  A„. 
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THEOREM  5:  With  the  assumptions  mentioned  earlier  the  following  relationship  is  true: 

Max  -  g(v)  =  Maxt/(v,/G 

> 

Subject  to:  /j  ( v )  >  s. 

.v  €  5 

PROOF:  Follows  directly  from  Theorems  3  and  4. 

3.  PAIRED  COMPARISON  METHOD 

Using  the  results  of  Section  2  we  will  develop  a  procedure  needing  onl>  a  paired  compari¬ 
son  from  the  DM,  i.e.,  given  any  two  feasible  solutions  and  their  outcomes.  sa>  > ' 1 '  and 
the  DM  only  has  to  specify  whether 

(16)  y1'  >  >’UI  or  >i<21  >■  ym  or  both 

where  denotes  "preferred  to." 

Interaction  of  this  form  is  presumably  much  less  demanding  than  asking  the  DM  to 
specify  his  local  tradeoffs.  We  will  presently  indicate  how  (16)  can  be  used  to  eliminate  a  por¬ 
tion  of  the  efficient  set  and  progressively  converge  to  the  "best  compromise  solution.” 

Based  on  the  theorems  given  in  Section  2,  the  BCMP  problem  has  been  reduced  to  deter¬ 
mining  the  maximum  of  g(v)  where  v  belongs  to  the  interval  (v,.  v*l  where  v,  =  /j(.v*>.  How¬ 
ever,  jg(v)  is  not  known  explicitly  since  U  is  not  known.  But.  using  a  search  technique  which 
requires  only  functional  comparison  and  not  function  values,  we  can  still  solve  the  BCMP  prob¬ 
lem,  using  the  following  region  elimination  concept: 

For  a  unimodal  function  g(v)  defined  over  a  finite  interval  (v,,v‘).  let  v 4  and  vg  be  two 
points  in  the  interval  such  that  v4  <  vg.  Then,  g(v4)  <  g(vg)  implies  that  the  maximum  of 
£<v)  will  not  lie  in  the  interval  ( v/f  v ^ ) .  On  the  other  hand.  ,t» ( v 4 )  >  g(vs)  implies  that  the 
maximum  will  not  lie  in  the  interval  (vg,v*). 

General  Steps  of  the  Algorithm 

Step  1:  Solve  PI:  Max  ./j(.v),  subject  to  .v  €  S. 

Set  max  ,/j  (x)  =  v* 

Solve  P2:  Max  /2(x),  subject  to  .v  €  S. 

Set  max  /2(x)  =  w* 

Step  2:  Solve  Max  /j(x),  subject  to  x  €  Sand  /2(x)  ^  tv*. 

Set  the  maximum  of  f\  v,.  Now  the  optimal  value 
of  ,/j  to  the  BCMP  lies  between  v,  and  v*. 

Step  3:  Choose  two  values,  \A  and  vg,  such  that  v,  <  \  A  <  vg  <  v*.  Solve  the  problem 
P v:  Maximize  /2(x),  subject  to  x  €  S  and  ,/j(x)  ^  v  for  v  =  \A  and  v  =  vg;  let 
g(v)  —  Max  /2(x)  for  Pv. 
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Step  4:  Let  y<u  =  [v^.gfvy),  y(2)  =  [vs,g(vB)].  Given  y111  and  y'*’,  the  DM  is  asked  to 
specify  whether  y{|)  >•  (preferred  to)  y<2\  or  y12>  ^  ym,  or  indifferent.  Usine  the 
DM’s  response,  a  portion  of  the  efficient  set  can  be  eliminated. 

If  y »  >  y(2),  then  set  \u  -  vB;  go  to  step  5. 

If  y<2>  >  y(1),  then  set  v,  =  v4;  go  to  step  5. 

Step  5:  If  |v,  —  vj  <  «  (chosen  small  number)  stop;  otherwise  return  to  step  3. 

Figure  1  illustrates  the  paired  comparison  method.  Here,  y121  >  y"1  since  U(y>2')  > 
U (y111).  Hence,  the  maximum  of  U  cannot  lie  in  the  interval  [v/.v  j.  Eliminating  the  region 
(v,,v4),  the  interval  of  uncertainty  where  the  optimum  lies  reduces  to  (v^,v*l  If  Golden  Sec¬ 
tion  Section  Search  is  used  to  generate  the  two  new  points  between  v4  and  v*one  of  the  new 
points  will  turn  out  to  be  vfi.  With  Golden  Section  Search,  we  will  be  able  to  bracket  the  best 
compromise  solution  y°  to  less  than  10%  of  the  original  interval  on  v  with  just  5  paired  compar¬ 
isons  from  the  DM;  in  10  paired  comparisons,  the  optimal  solution  will  be  within  1%  of  the  ori¬ 
ginal  interval  of  uncertainty  [v(,v*].  Thus,  for  a  specified  level  of  uncertainty  we  have  a  finite, 
rapidly  convergent  procedure  using  only  paired  comparison  of  two  efficient  solutions. 
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Numerical  Example 

Consider  the  following  bicriteria  program  [71  with 

f\(x)  =  32  —40  x2  +  23  Xj  —  7  x4 

ftix)  =  32  —  10  X|  —  4  x2  +  7  X)  —  7  jr4 

and  the  feasible  region  5  is  specified  by  the  following  linear  equality  constraints  in  detached 
coefficient  form  in  addition  to  the  usual  nonnegativity  restrictions: 


•Xj 

*2 

*3 

*4 

*5 

*6 

i  b 

1 

0 

-23/50 

27/50 

0 

0 

0 

1 

-65/100 

35/100 

0 

0 

0 

0 

1/25 

1/25 

1 

0 

16/25 

0 

0 

15/50 

-35/50 

0 

1 

In  order  to  simulate  the  interaction  process,  we  will  assume  that  the  DM's  implicit  prefer¬ 
ence  function  (which  guides  him  in  the  selection  process)  is  the  following  quasiconcave  func¬ 
tion 

Uift.ft)  -  fln  ft. 

The  true  optimal  solution  is  .v*  =  (A|,x2,A3,jr4,X5,A6) 

-  (1.2857,  0,  0.7895,  3.1805,  0.4812,  2.7895)  with  ft  =  (./?,  ft) 

*  (27.895,  2.406)  and  U(ft.  ft)  -  22.13. 

Solution  Using  Paired  Comparison  Method 

Step  1:  Solving  (PI)  and  (P2)  we  get  v*  =  60  and  w *  «  3.2 
Hence  vu  =  60 

Step  2:  Solving  (@;.)  yields  v,  =  8 
Iteration  1 

Step  3:  Using  Golden  Section  ratios  0.618  and  0.382,  v4  =  27.864  and  v8  =  40.136.  Solve 
the  problem  /\  for  v  -  v4  and  vfl.  g(v^)  -  2.407  and  g(vfi)  =  1.707 

Step  4:  Interact  with  the  DM  and  seek  his  paired  comparison  between  y1*  -  (27.864, 
2.407)  and  yl2)  —  (40.136,  1.707).  Assuming  that  DM  is  guided  by  the  implicit 
preference  function  U(f\.f2)  -  f\/}  ft,  t/[y(,>l  —  22.12  and  f/Ly(2)l  —  20.01.  By 
the  property  of  the  preference  function,  /n  >  yQ)  if  t/[>,<,,l  >  (/ly21).  Hence, 
yu  would  be  preferred  to  y<2>  Consequently,  vu  will  be  updated  to  vs  —  40.136, 
thereby  eliminating  a  portion  of  the  efficient  frontier. 

Step  5:  Assuming  that  the  interval  of  uncertainty  is  to  be  reduced  to  10%,  the  search  has 
to  be  continued  further  on  the  reduced  efficient  frontier.  For  brevity  of  presenta¬ 
tion  the  results  of  further  calculations  are  summarized  in  Table  1. 
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TABLE  1  —  Solution  of  Bicriteria  Problem  Using 
Paired  Comparison  Method 


Iteration  # 

vt 

\u 

v< 

vs 

V" 

(21 

y 

1/ 1/"] 

C/T.t':,l 

Remaining 
Interval 
<%  of  Orig¬ 
inal) 

0 

8 

60 

— 

- 

— 

— 

- 

100 

1 

8 

40.136 

27.864 

40.136 

(27.864. 

2.4071 

<40.136. 

1.707) 

22.12 

20.01 

61.8 

2 

20.276 

40  136 

20.276 

27.864 

(20.276. 

2.841) 

(27.864, 

2.407) 

21.12 

22.12 

38.2 

3 

20.276 

32.549 

27.864 

32.549 

(27.864. 

2.407) 

<32.549. 

2.140) 

22.12 

21.82 

23.6 

4 

24.964 

32.549 

24.964 

27.864 

(24.964, 

2.573) 

<27.864, 

2.407) 

21.97 

22.12 

14.6 

5 

24.964 

29.652 

27  864 

29652 

(27.864, 

2.407) 

<29.652. 

2.306) 

22.12 

22.09 

9.0 

4.  COMPARATIVE  TRADEOFF  METHOD 

The  local  trade-off  at  a  feasible  point  x  (l  S,  between  criteria  2  and  1  is  defined  to  be 


evaluated  at  x. 

Tn  measures  the  loss  in  criterion  f2  from  the  current  level  the  DM  is  prepared  to  trade¬ 
off  for  unit  gain  in  criterion  fx. 

If  the  DM  can  provide  local  trade-offs  then  the  problem  can  be  solved  interactively  using 
Geoffrion-Dyer-Feinberg  [8]  type  approaches.  However,  it  has  been  repeatedly  observed  in 
practice  that  precise  tradeoffs  are  difficult  to  provide.  However,  the  DM  can  provide  "impre¬ 
cise”  tradeoffs  much  more  easily,  e.g..  "interval  estimates"  in  place  of  "point  estimates."  For  the 
bicriteria  case,  an  estimate  of  the  local  tradeoff  that  is  easier  than  the  interval  estimate  is  the 
"comparative  estimate."  The  DM  is  provided  with  a  number  and  will  be  asked  to  respond 
whether  or  not  he  would  be  prepared  to  tradeoff  more  than,  less  than,  or  equal  to  the  specified 
number  in  criterion  /2  for  a  unit  gain  in  criterion  f\.  Based  on  the  DM’s  response,  a  portion  of 
the  efficient  frontier  can  be  eliminated.  We  will  presently  indicate  how  the  results  of  Section  2 
can  be  used  to  provide  the  basis  for  the  "comparative  trade-off  method." 

At  any  feasible  point  x  E  S,  let  X2)  be  the  perturbation  in  /2  from  its  current  level  per 
unit  perturbation  in  /(.  Limiting  ourselves  only  to  efficient  solutions,  we  find  that  Theorem  2 
provides  us  a  method  of  generating  efficient  solutions;  also  Theorem  1  specifies  that  the  con¬ 
straint  f\(x)  >  v  in  program  P-  will  be  binding  in  every  optimal  solution.  Hence,  x2)  is 
obtainable  as  the  negative  of  the  Lagrange  multiplier  associated  with  the  constraint  f\(x)  >  v. 
Note  that  X2)  is  part  of  the  solution  output  of  most  mathematical  programming  algorithms 
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X2)  has  the  following  interpretation:  X2|  measures  the  loss  in  f2  from  its  current  level  that 
must  be  suffered  per  unit  gain  in  /j.  On  the  other  hand  T2\  measures  the  loss  in  />  the  DM  is 
prepared  to  trade-off  per  unit  gain  in  ,/j.  Naturally,  these  two  quantities  must  be  related  to  the 
optimality  of  BCMP  and  the  precise  relation  is  the  subject  of  our  next  theorem. 

THEOREM  6: 

Let  x *  be  the  optimal  solution  to  BCMP  with  fix r*)  =  (/*,  /*).  Let  x  be  any  efficient 
solution.  Then 


(a)  x *  is  optimal  to  BCMP  if  and  only  if  X2)  =  T2 1,  at  x* 

(b)  At  x  if  X2|  <  T2 1  then  ,/j  <  /*;  similarly,  if  X2|  >  T2I  then  ,/j  >  ,/f. 


PROOF:  Part  (a)  Sufficiency:  At  x *  let  X2|  =  r2).  By  definition  of  7"2(,  at  x *  the  DM  is 


indifferent  to  unit  gain  in  ,/j  and  A f2  units  of  loss  in  f2  where  A f2  = 
other  words 


bU 


a/t 


dU 


8/j 


In 


(18)  [/i(x*),/2(x*))  -  [/,(*•)  +  S|,/2(x*)  -  A J2  ■  8,] 

where  denotes  the  indifference  of  the  DM  and  8)  is  the  differential  change  in  ,/j.  When 

A2i  =  Tn,  in  the  neighborhood  of  x*,  a  differential  change  8,  in  ,/j  is  accompanied  by  a 
differential  change  of  -(A /2)  •  8,  in  /2,  as  per  the  definition  of  X2).  Hence,  points  in  the 
neighborhood  of  x*  are  equally  preferred  to  x*;  hence,  x*  is  the  local  optimum.  By  quasicon¬ 
cavity  of  U.  x*  must  be  the  global  optimum  too. 


Necessity:  Let  x*  be  optimal  to  BCMP  but  X21  ^  Tji  say  X2|  <  T2\.  Around  a  neighbor¬ 
hood  of  x*,  3  a  point  l./j(x*)  +  8,,  /2(x*)  —  8tA2|]  where  8,  is  a  differential  change  in  ,/j  as 
per  the  definition  of  X2).  At  X2t  <  7"^, 

(19)  (/2(x*)  -  8|X2t)  >  (/2(x*)  -  8,r2!)  TorS,  >  0. 

Since  U  is  strictly  increasing  in  both  /j  and  /2,  inequalities  (18)  and  (19)  taken  together  imply 
that 

(20)  l/,(x*)  +  8,,/2(x*)  -  8,X2i]  >  |/,(x*)  +  8|,/2(x*)  -  8|  7~2 , ] 

—  [/|(x*)./2(x‘)l. 

By  transitivity  of  preferences 

(21)  [ /|(x*)  +  8,.  /2(x*)  -  8|X2|]  >-  l/,(x*)./2(x*)]. 

Obviously,  (21)  contradicts  the  optimality  of  x*.  Hence,  X2|  <  T2[.  Similarly  we  can  show  that 
x2|  >  f21.  Hence  the  proof. 


Part  (b)  Let  X2|  <  T2\.  By  (21)  an  increase  in  /t  from  its  current  value  leads  to  a  pre¬ 
ferred  solution.  Hence,  an  increase  in  ,/j  is  a  locally  increasing  direction  for  U  along  the 
efficient  frontier,  Also  x  being  an  efficient  solution  is  a  solution  to  program  (Pv)  by  Theorem  2 
and  /|(x)  -  v  by  Theorem  1.  We  noted  earlier  that  x  is  a  solution  to  program  (P3)  also,  i.e.. 

Max  U(\,f2).  Theorem  5  established  the  unimodality  of  #(v)  which  is  the  "value"  of 

iv.r2if  a 
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optimal  solution  to  program  (P3);  as  /j(x)  -  v,  the  value  of  optimal  solution  to  program  (P3) 
is  unimodal  in  /j(x).  By  the  property  of  unimodality,  U  increases  along  a  locally  improving 
direction  continuously  up  to  the  optimal  solution  and  continuously  decreases  in  the  other  direc¬ 
tion.  Hence, /|(x*)  >  ,/j(x)or/f>  /,.  Similarly,  we  can  prove  the  other  case  also. 

Note  that  Theorem  6  provides  a  convenient  way  of  ruling  out  a  portion  of  the  efficient 
frontier.  All  that  is  needed  is  an  interaction  with  the  DM  whether  or  not  A2i  -  F2!.  Depending 

> 

on  his  response  we  can  narrow  down  the  search  of  efficient  points  to  only  those  with  /,  value 
greater  than  or  less  than  the  current  /,  value.  We  wish  to  point  out  here  that  the  Surrogate 
Worth  Tradeoff  method  due  to  Haimes,  Hall  and  Freedman  [9]  is  essentially  based  on  a  result 
similar  to  part  (a)  of  Theorem  6.  However,  they  do  not  have  a  result  comparable  to  part  (b)  of 
the  Theorem.  Thus  no  "region  elimination"  is  possible  in  their  method.  Also  the  DM  has  to 
specify  how  far  T2\  is  greater  than  or  less  than  X2)  on  a  subjective  scale  of  + 10  to  -10. 


We  now  formally  state  the  procedure  as  follows: 

Step  1:  Solve  (PI)  and  (P2)  and  determine  v*  and  w*. 

Step  2:  Solve  (£?*•)  and  determine  f\(x*).  Set  v,  =  /j(x2). 

Step  3:  Solve  Pv  for  v  =  v^  where  v,  <  v^  <  v„.  Determine  Aji,  i.e.,  the  Lagrange  mul¬ 
tiplier  corresponding  to  the  constraint  f\(x)  ^  v^  in  the  program  Pv. 

Step  4:  Interact  with  the  DM  and  present  him  with  X2i  and  seek  his  comparative  trade-off 
T-i  1,  i.e.,  whether  f2 1  ^  *21-  Depending  on  the  outcome  of  interaction  eliminate  a 

portion  of  the  efficient  frontier  using  the  following  rule: 

If  X2(  <  T2 1,  update  v,  =  v,,;  go  to  step  5. 

If  X2|  >  r2) ,  update  vu  =  v^;  go  to  step  5. 

If  \2|  -  f2„  stop;  the  current  solution  to  Pv  is  optimal  to  BCMP. 

Step  5:  If  |v,  -  vj  <  {  (chosen  small  number)  stop;  else  go  to  step  3. 

Note  that  selection  of  v^  values  can  be  done  efficiently  using  the  midpoint  of  the  interval. 
For  example,  with  5  stages  of  interaction,  we  will  be  able  to  bracket  the  optimal  solution  to  less 
than  4%  of  the  original  interval.  In  7  stages,  we  will  be  within  1%,  etc.  Thus  we  have  a  finite, 
rapidly  converging  procedure  using  only  "comparative  tradeoffs."  We  shall  illustrate  the  method 
using  the  same  example  given  in  Section  3. 

Numerical  Example 

Solution  Using  Comparative  Trade-Off  Method: 

Step  1:  As  before  v*  -  60;  w*  -  3.2.  Hence  v„  *•  60. 

Step  2:  As  before  v,  -  8. 
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Iteration  1 

Step  3:  Using  the  'Bisection'  method  (i.e.,  midpoint  values)  to  choose  vA  values,  v,,  =  34. 
xA  which  solves  Pv  for  v  =*  vA  has  /  values  of  (34,2.057);  also  the  Lagrange  multi¬ 
plier  X  j  i  =  0.0571. 

Step  4.  Interact  with  the  DM  and  seek  his  comparative  estimate  of  local  trade-off  r2),  i.e., 
whether  T2I  >  A2),  T2l  <  A2i  or  T2l  -  A2|.  Assuming  that  the  DM  is  guided  by 
the  implicit  preference  function 


U(fuf2)  =  /,2/3  f2\  we  get 


6  U 

a/. 

bU 

a/, 


2/3  /{ 1/3  f2  and  ~r  =  Evaluating  at  xA , 
0/2 

j  (34)_  1/3  (2.057)  =  0.4233  and 


bU 

a./: 


342/3  =  10.4951.  Hence,  T2l 


6  U  I  8  U 

a/,  /  3/2 


0.4233 

10.4951 


-  =  0.0403. 


Thus,  A2i  =*  0.0571  >  r2 1  =  0.0403.  Hence,  update  \u  to  vA  =  34 

Step  5:  As  the  limits  and  vu  are  not  close  enough,  the  search  has  to  be  continued 
further  on  the  reduced  efficient  frontier.  The  details  of  the  calculation  for  further 
stages  are  summarized  in  Table  2. 


TABLE  2  —  Solution  of  Bicriteria  Problem  Using 
Comparative  Tradeoff  Method 


Iteration  # 

— 

“I 

— 

L- 

— 

V4 

T 

> 

ac 

T 

> 

*2! 

^21 

Remain  Interval  J 

(%  of  original  interval)  | 

1  0 

i 8 

60 

- 

— 

— 

100 

1  1 

8 

34 

34 

(34,2057) 

0.0571 

00403 

50 

2 

21 

34 

21 

(21,2.800) 

0.0571 

00889 

25 

1  3 

27.5 

34 

27.5 

(27  5,2.429) 

0.0571 

0.0589 

12.5 

1  4 

27.5 

30.75 

30.75 

(30  74,2.243) 

0.0571 

0.0486 

6  25  | 

5.  BICRITERIA  OPTIMIZATION  APPLIED  TO  THE  U.S.  AIR  FORCE 
CARDIOVASCULAR  DISEASE  CONTROL  PROGRAM 

In  this  section  we  will  demonstrate  an  application  of  the  paired  comparison  method  to  a 
bicriteria  optimization  problem  of  designing  a  cardiovascular  disease  control  program  for  the 
U.S.  Air  Force  personnel,  this  part  of  a  study  currently  undertaken  by  Purdue's  School  of 
Industrial  Engineering,  under  contract  with  the  USAF  School  of  Aerospace  Medicine. 

The  United  States  Air  Force  (USAF)  is  planning  a  comprehensive  program  for  reducing 
the  incidence  of  cardiovascular  disease  (CVD)  among  its  active  duty  personnel.  This  program, 
known  as  the  Health  Evaluation  and  Risk  Tabulation  (HEART)  program,  would  consist  of 
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(i)  an  extensive  educational  program  about  CVD,  its  causes  and  prevention, 

(ii)  a  risk  screening  program  to  identify  those  AF  personnel  who  have  a  high  susceptibil¬ 
ity  for  developing  CVD,  and 

(iii)  a  risk  intervention  program  to  treat  such  high  risk  personnel  and  reduce  their  CVD 
risk  by  medications  and/or  behavior  modifications. 

In  order  for  such  a  program  to  be  viable,  the  expected  benefits  resulting  from  reduced 
incidence  of  heart  disease  must  outweigh  the  cost  of  the  HEART  program. 

Risk  Identification:  A  general  cardiovascular  risk  profile,  based  on  the  Framingham  Study 
[11],  is  presently  being  considered  as  the  means  of  identifying  high-risk  personnel  for  treat¬ 
ment.  With  this  approach  a  logistic  regression  model  would  be  applied  to  each  individual  and 
would  yield  his  CVD  risk;  namely,  the  probability  of  manifestation  of  cardiovascular  disease  in 
the  individual  in  an  eight-year  time  period. 


Risk  Intervention:  By  rank  ordering  the  CVD  risk  scores  of  USAF  personnel,  a  certain 
percentage  of  the  highest  risk  individuals  are  selected  for  specialized  face  to  face  therapy  for 
modifying  their  elevated  values  of  risk  factors.  At  present,  those  high-risk  individuals  with 
elevated  systolic  blood  pressures  will  be  treated  with  medication,  while  those  with  elevated 
cholesterol  levels  will  be  treated  with  diet  modification  only.  Both  individual  and  group  thera¬ 
pies  will  be  used  to  persuade  smokers  to  quit  smoking. 


The  recurring  cost  of  the  HEART  program,  excluding  the  initial  start-up  costs,  is 
estimated  between  7  and  9  million  dollars  a  year  depending  on  the  total  size  of  the  therapy 
group.  Hence,  before  implementing  the  HEART  program  on  all  airforce  bases,  the  Air  Force 
was  interested  in  answers  to  the  following  question: 


Given  the  total  size  of  the  therapy  group,  determine  the  optimal 
number  of  Air  Force  personnel  to  be  selected  in  each  age  group 
of  flyers  and  nonflyers  such  that  the  total  "cost-effectiveness"  of 
the  HEART  program  is  maximized. 


The  therapy  selection  problem  has  indeed  two  competing  objectives.  One  objective  is  to 
minimize  the  USAF  CVD  risk  and  the  other  objective  is  to  minimize  the  HEART  program 
cost.  By  increasing  the  size  of  the  therapy  program,  the  CVD  incidence  in  the  Air  Force  will  be 
progressively  reduced  but  this  can  be  achieved  only  at  the  cost  of  increased  HEART  program 
expenditure.  If  we  follow  a  policy  of  selecting  the  high  risk  personnel  from  each  age  group  for 
risk  intervention,  the  graph  of  CVD  incidence  versus  therapy  size  will  be  a  monotonically 
decreasing  function.  Hence,  there  is  an  optimal  expenditure  level  for  the  HEART  program,  an 
optimal  level  of  CVD  incidence,  an  optimal  therapy  size  and  an  optimal  selection  strategy  for 
that  therapy  size.  Determination  of  all  these  quantities  is  a  nontrivial  problem  for  which  we 
developed  the  following  bicriteria  mathematical  programming  model  which  was  solved  using  the 
paired  comparison  method  developed  in  Section  3. 
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BCMP  Model 

Minimize 

Minimize 

Subject  to 

Where: 


f\(x) 

Mx) 

17 

I  <  s 

7-1 

a,  <  Xj  <  b, 

7-1 . 17. 


x  =  selection  strategy  (x|,jr2 . x17)  for  each  age  group  of  flyers  and  nonflyers. 

f\(x)  —USAF  CVD  risk  for  a  given  selection  strategy  x 
y2(jc)  —HEART  program  cost  for  a  given  selection  strategy  x 
S  —  Maximum  size  of  the  therapy  group  (fixed) 

a i  —  Minimum  number  to  be  selected  from  age  group  j  (flyers  and  nonflyers); 

7=1 . 8  correspond  to  flyers  in  age  groups  20-24. ...,55-59,  respecitvely; 

7  =11 . 17  correspond  to  nonflyers  in  age  groups  15-19 . 55-59,  respectively. 

b,  =  Maximum  number  that  can  be  selected  from  age  group  j. 

Even  though  /2(jc)  could  be  determined  analytically,  ft(x)  is  indeed  a  random  variable 
because  of  the  unpredictability  of  the  effectiveness  of  the  risk  intervention  programs  on  modi¬ 
fying  the  risk  factor  levels.  Hence,  a  comprehensive  simulation  model  known  as  P-HEART 
(Purdue  Health  Evaluation  and  Risk  Tabulation)  was  developed  by  the  authors  [14].  P- 
HEART  simulates  the  USAF  population  and  performs  risk  identification,  therapy  selection  and 
risk  intervention.  The  description  of  the  simulation  model,  its  validation  and  findings  are  given 
in  Reference  [14].  Through  extensive  simulation  runs,  the  expected  value  of  ,/j(x)  for  different 
selection  strategies  were  estimated  and  a  polynomial  equation  was  fitted  to  get  analytical  forms 
for  f\(x)  by  age  group  and  flying  status. 

The  paired  comparison  method  was  then  programmed  within  the  frame  work  of  interac¬ 
tive  solution  of  BCMP  for  minimizing  CVD  risk  and  HEART  program  cost.  The  Generalized 
Reduced  Gradient  Algorithm  was  used  to  solve  the  nonlinear  programs  in  Steps  1,  2,  and  3  of 
the  interactive  algorithm.  The  interactive  program  was  extensively  tested  at  Purdue  using  Air 
Force  Officers  as  decision  makers.  The  program  has  been  well  received  by  the  policy  makers  in 
the  USAF  HEART  program  office  and  is  being  used  as  part  of  the  long  term  project  planning 
currently  underway  in  the  Air  Force.  For  more  details  on  the  interactive  computer  program 
and  its  output,  the  reader  is  referred  to  Sadagopan  [16]. 

6.  CONCLUSIONS 


In  this  paper  we  have  developed  interactive  procedures  for  the  solution  of  bicriteria  pro¬ 
grams.  Even  though  the  entire  set  of  efficient  solutions  can  be  generated  and  pictured,  such 
iteractive  procedures  do  have  merit  due  to  the  following  reasons:  in  many  real  life  problems 
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complete  generation  of  all  efficient  solutions  may  be  computationally  expensive;  even  granting 
such  generation,  the  DM  is  unaided  in  the  selection  process.  Hence,  the  procedures  developed 
in  this  paper  may  be  thought  of  as  "decision  aids"  (15], 

The  primary  power  behind  our  procedures  is  the  unimodality  property  (Theorem  3).  A 
similar  result  is  available  in  GeofTrion  [7],  However,  our  proof  of  unimodality  is  entirely  new 
and  allows  extension  to  more  general  cases  unlike  Geoffrion's  result.  Also  our  relationship 
between  Lagrange  Multipliers  and  local  tradeoffs  is  new.  For  more  details  on  the  bicriteria 
results,  U.S.  Air  Force  application  and  extensions  to  the  general  multi-criteria  problem,  the 
reader  is  referred  to  Sadagopan  [16]. 
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ABSTRACT 

There  are  a  great  number  of  queueing  sxstcmv  including  t he  A/'  A/’/c.  ihe 
G/'/A//c  and  the  discrete  Chin  I  queue  in  which  Ih  -  ie  probabilities  are 
determined  h>  repeated  queue  equations  I  his  pape"  -  a  simple,  efficient 
and  numericalb  stable  algorithm  to  calculate  the  sta  ;  a\  .lies  and  measure 
of  performance  for  such  s> stems.  The  method  avoi  ti.  complex  arith- 
metric  and  matrix  manipulations 


1.  INTRODUCTION 

This  paper  gives  a  new  method  to  find  the  equilibrium  probabilities  of  continuous-time 
Markov  processes  in  which  the  rale  of  going  from  state  tit  to  it  is  equal  to  d, for  any  n 
exceeding  a  certain  limit,  say  r.  The  largest  and  smallest  possible  jumps  are  It  and  — ,c  respec¬ 
tively.  that  is,  the  subscript  of  dk  runs  from  -y  to  It.  These  concepts  will  be  further  expanded 
in  the  next  section.  Here,  we  demonstrate  it  shortly,  using  the  M V M V I  queue  as  an  example. 
In  this  queue,  arrivals  of  size  /,  1  <  ./  ^  li  occur  at  rate  K,.  and  departures  of  groups  of  size  i 
occur  at  a  rate  of  yj.,.  1  ^  /  C  ft-  Hence. 

d,  =  K , .  I  <  ./  <  ii 

d-,  =  fji.,.  i  <  n.  1  <  i  <  y. 

The  paper  also  considers  discrete  Markov  processes  with  a  similar  structure.  In  this  case, 
there  is  a  probability  of  pk  to  go  from  state  n  to  n  +  k.  Queues  with  such  structure  include  the 
system  size  of  the  67'/,V//c  queue  (4|  and  the  waiting  lime  of  the  discrete  G7/C/1  queue  1 5 j 

At  present,  there  are  several  approaches  available  to  solve  such  queueing  problems.  In 
the  first  approach,  one  sets  up  generating  functions,  and  uses  partial  fraction  expansions  to 
invert  these  generating  functions.  These  expansions  require  one  to  find  the  roots  of  an  equa¬ 
tion  of  degree  y  +  h.  The  same  is  true  if  one  uses  operators  on  the  queueing  equations  and 
determines  the  characteristic  roots.  I  or  practical  applications,  these  approaches  have  some 
disadvantages.  The  equations  are  often  difficult  to  solve,  especially  when  they  are  of  a  high 
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degree.  Their  roots  are  usually  complex  and  difficult  to  work  with.  Moreover,  there  may  be 
double  roots  which  lead  to  further  complications.  Neuis  (3 !  therefore  suggested  solving  these 
problems  using  matrix  equations. 

The  method  in  this  paper  combines  the  approach  of  N'euts  wiih  the  classical  method.  In  a 
sense,  it  can  be  considered  as  a  modification  of  the  method  of  Neuts.  The  main  difference 
between  our  method  and  the  Method  of  Neuts  is  the  following.  In  Neuts-  approach,  a  matrix  R 
of  dimension  h  x  h  has  to  be  calculated.  In  our  approach,  only  the  first  column  of  R  is  calcu¬ 
lated.  which  reduces  the  number  of  operations  approximately  by  a  factor  of  h.  Even  though  the 
method  is  similar  to  the  one  of  Neuts,  its  derivation  is  done,  using  the  classical  approach.  In 
this  way,  our  method  bridges  the  gap  between  the  classical  approach  and  the  approach  of  Neuts. 


2.  PROBLEM  DEFINITION 


In  this  section,  a  precise  problem  definition  is  given  and  demonstrated,  using  a  number  of 
examples,  first,  we  discuss  the  discrete  case.  Let  there  be  a  Markov  chain  with  the  transition 
probabilities.  n.tn  ^  0  The  equilibrium  probabilities  of  this  chain  are  denoted  by  tt„. 
that  is,  n „  is  probability  to  be  in  state  n  As  is  well  known,  the  n„  are  determined  by  the  fol¬ 
lowing  equilibrium  equations,  provided  the  system  has  an  equilibrium 

(I)  TT„  -  £  Tr„,  n  >  0. 

Di-it 

We  assume  that  there  is  a  certain  limit  /-such  that  for  n  >  r.  only  depends  on  the 

difference  between  m  and  n.  In  other  words,  if  Q  represents  the  state.  dk  is  the  probability  to 
increase  Q  by  k  from  n  to  n  +  k.  If  tr„,  is  defined  to  be  zero  for  m  ^  0.  equation  1 1 )  becomes 
therefore  for  n  ^  r 


£  vmdn.m“  £  "V*  A,  n  >  r. 

We  furthermore  assume  that  dk  «  0  for  k  <  ~#or  k  >  h.  Thus,  one  has 

h 

n"  ~  H  w n-k  dk.  n  >  r. 

k—K 

If  one  solves  this  equation  for  7r„,  one  gets 


=  £  *n~k  ek-  "  >  r- 


(2) 

The  ek  in  this  expression  are  defined  as 
(3) 


ek  =  dk/(  \  —  d0),  k  ^  0 
e„  =  0. 

for  n  <  r,  one  uses  equation  (1),  that  is 


14) 


£  *mPm.n •  P  <  r. 

m—0 


Equation  (2)  will  be  referred  to  as  the  queueing  equation ,  whereas  equation  (4)  gives  the  initial 
conditions.  There  are  r  initial  conditions.  However,  it  can  be  shown  that  one  of  them  is  redun¬ 
dant.  In  its  place,  one  has  the  normalizing  condition 
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n-0 

If  there  exists  a  unique  equilibrium  vector  (77J.  this  vector  is  fully  determined  by  the  queueing 
equaiion.  the  r  —  1  initial  condition  and  the  normalizing  condition.  01  course,  if  r  =  1.  there 
are  no  initial  condition^.  As  is  well  known,  an  equilibrium  solution  will  exist  il 

/*  =  £  /  2L  1  ll  '  <  1  ■ 

i=i  /=  i 

This  paper  will  show  how  to  find  this  equilibrium  solution  in  an  efficient  way. 

In  continuous-time  Markov  chains,  the  equilibrium  probabilities  are  ueicrmined  by 
<6)  0  =  £  n  Z  0. 

Here.  qm„.  m  ^  n.  are  the  transition  rates,  and 

Win  ni  “  ~  Yg  tii,  n  • 

>i  *■  m 

We  now  consider  problems  where 

q„,„  =  n  ^  r. 

Here.  </„  is  defined  as 

d»  =  -  L  4  • 

As  in  the  discrete  case,  it  is  assumed  that  dk  =  0  outside  the  range  — g  <  t  <  h.  If  n„  =  0  for 
n  <  0,  one  finds  thus  from  equation  (6) 

h 

0  =  52  nn-kdkt  n  >  r. 
k—K 

This  equation  can  again  be  solved  for  rr„,  which  gives 

h 

(7)  TT„  =  £  n„^k  ek.  n  ^  r. 

Here, 

e„  =  0 

<8)  «’*  =  dkn-d„).  k*  o. 

Note  that  equation  (2)  and  equation  (7)  have  exactly  the  same  structure.  In  particular.  i\  >  0 
and  the  sum  of  the  ek  is  one  in  either  case.  The  normalizing  condition  is  given  by  (51  both  in 
the  discrete  and  the  continuous  case.  The  initial  conditions  of  the  continuous  case  are  easily 
found  from  (6)  by  letting  0  <  n  <  r. 

We  now  give  3  examples  which  will  be  used  later  for  the  purpose  of  demonstration. 

EXAMPLE  I.  First,  consider  the  waiting  time  in  queue  of  a  discrete  GI/GI 1  queue.  Fol¬ 
lowing  Ponstein  (51,  a  Markov  chain  is  set  up,  in  which  the  state  space  are  the  waiting  times 
rather  than  the  number  of  elements  in  the  system.  To  set  up  this  chain,  it  is  assumed  that  the 
difference  between  the  service  time  (S)  and  the  mtcrarrival  time  {A )  is  (a)  always  integer  and 
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t b >  always  between  —  .e  and  h.  It  is  well  known  (see  e.g.  Ponstein  15]).  that  the  waiting  time  of 
customer  /  exceed  the  one  of  customer  /  —  I  by  an  amount  .S'  —  .4.  provided,  of  course,  that 
customer  /  -  I  had  a  waiting  time  above  —  (S  -  4  I.  Consequently,  if  H  represents  the  wait¬ 
ing  time  of  a  customer  in  equilibrium.  H  increases  by  A  with  probability  P(S  —  A  —  A).  This 
means 

dk  =  m  -  A  =  A),  -g  <  A  ^  /; 
and 

i\  =  PIS  -  A  =  A  )/[l  -  m  =  A  )|.  A  *  Ol 


In  the  problem  considered,  there  is  only  one  initial  condition,  and  this  condition  can  be 
dropped  as  redundant.  The  queueing  equation,  together  with  normalizing  condition  is  thus 
sufficient  to  determine  all  tt„.  Here,  7r„  represents  the  probability  of  having  a  waiting  time  of 
n. 


EXAMPLE  2:  In  the  67"/.V//l  queue,  arrivals  occur  in  groups  of  size  h.  Let  a,  be  the 
probability  of  servicing  /  elements  between  two  consecutive  arrivals  and  let  n„  the  probability 
of  having  n  elements  immediately  before  an  arrival.  It  is  easily  verified  that  the  queueing  equa¬ 
tion  is  given  by  (2)  with  r  =  I  and 


(10) 


i\  =  a„..k/(  1  -  «,,).  <  A  <  /;.  A  0 

<'<>  =  0.  j 


Here.  is  the  probability  of  serving  A  customers  between  two  successive  arrivals.  The  initial 
conditions  consist  only  of  one  equation  which  can  be  omitted.  Also,  in  the  case  considered,  g 
is  infinite.  However,  for  practical  calculations,  it  is  sufficient  to  carry  only  a  finite  number  of 
<V 


EXAMPLE  3:  Consider  the  Mx/My/\  queue.  A,,  I  <  ./  <  h.  is  the  rate  at  which  groups 
of  size  ./ arrive.  No  service  is  done  unless  at  least  g  elements  are  in  the  system.  If  the  number 
in  the  system  is  g  or  more,  there  is  a  rate  of  m,-  I  ^  ^  g.  that  /  elements  leave  the  system 

after  having  received  service  together.  For  this  problem,  one  has  the  following  transition  rates 

<fn:»  =  o  <  II  -  m  ^  h 

Qm.n  “  P-  m-n'  0  <m-n^g,  g. 


All  other  qm  n.  m  ^  n  are  zero,  in  particular,  all  m  <  g ,  n  <  m  are  zero  because  nobody 
leaves  if  there  are  less  than  g  elements  in  the  system.  The  qm  m  become 


dm.m  =  ~  L  K'  •  m  <  q 

/-I 


L x-  + 

/-i  ,-i 


m  >  g. 
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A  lutle  reflection  shows  that  r  =  ,e.  and  the  queueing  equation  is  given  by  (5)  with 

On  =  0 

h  s 

Z x*  +  5>, 

i-i  i-i 

o*  =  m*/  £  +  X  **< 

/-i  i-i 

The  initial  conditions  are  given  by  (7),  that  is 

oo  h 

0  =  Z  "kQk.o  =  _7r»  Z  X'  + 

A— 0  /-l 

oo  /;  /?  £ 

o  =  Z  =  Z  7T"-/X/  ~  Z  K>  +  Z  «  <  & 

k =0  /-I  /-I  /  -  v  - n 

Again,  the  convention  was  used  that  7r„,  =  0  if  m  <  0.  From  the  initial  conditions,  one  can  be 
omitted,  and  it  turns  out  to  be  convenient  to  omit  the  one  corresponding  to  n  =  g  —  1. 

Above,  we  discussed  a  few  examples  which  are  amenable  to  our  method.  Additional 
examples  can  easily  be  generated.  We  now  show  how  to  solve  these  examples. 

3.  THE  MAIN  RESULT 

The  fundamental  result  of  this  paper  is  the  following.  If  p  <  I.  and  if  the  Markov  chain 
is  ergodic.  there  are  h  values  o,  >  0,  1  <  j  ^  h  such  that 

h 

(I2>  =  Z  °/7r"-/-  »  >  r. 

,/«i 

The  ar  1  <  j  ^  h  are  determined  in  a  unique  way  by  the  ek ,  —r  ^  A  <  h.  Further¬ 
more, 

h 

z  «/  <  '• 

J- 1 

This  paper  will  present  several  methods  to  find  the  Once  the  o,  are  found,  the  problem  is 
actually  solved.  Together  with  the  r  —  1  initial  conditions  and  the  normalizing  equation  (12) 

gives  exactly  r  +  l  equation  10  determine  nn,  nt . 7rr.  Moreover,  the  generating  function 

of  the  7T„  can  be  expressed  in  terms  of  a,  and  tra,  tt\ . rr,_t.  To  see  this  it  is  convenient 

to  define  an  as  - 1 ,  and  set 

h 

A  (z)  = 

/-n 

n-fl 

In  this  case,  one  finds,  as  will  be  shown  in  Section  4: 

(13)  A(z)P<2)=  £<j, 

/“  0  IT-/ 


VOL  29,  NO  3,  SEPTEMBER  1982 


NAVAL  RESEARCH  LOGISTICS  Ql  ARTERLT 


466 


\\  K  C.K ASSM  \NN  \\l)  M  I  (TIUi)IIRS 


When  setting  z  =  1.  P(z)  becomes  I,  and 


This  means 
(14) 


A  <1>  =  £  a,  £  =  £  TTn  £  a,.  e  =  mini/-  -  n  -  1.  /;). 

/— 0  «=  /  r/«0  /»0 


h  r- 1  e 

£  a,  =  - 1  +  a,  +  a_,  +  ...  +  dj  =  £  7r„  X0'1  e  =  —  '  •  h) . 

/*■!)  /j  — 0  0 


This  equation  provides  a  convenient  normalizing  condition  and  can  be  used  in  place  of  (5).  To 
find  L.  the  expected  number  in  the  system,  one  can  differentiate  (13)  and  find 

A'(:)P(:)  +  ,4<z)F(z>  -  £  a,  £  nn 

/“0 

If  z  =  1,  /* ( r )  =  1  and  /*'( r )  =  L  Using  these  values,  the  above  equation  gives  after  some 
calculation 


<15)  Z.  =  £  a,  ./I  /  £  a, 

/- o  [»)-,  J  /-o 

Equations  (14)  and  (15)  simplify  if  r  =  I.  Then 

A  (z)  P(z)  =  CloTT o  =  -7T0. 

If  one  sets  z  =  1,  this  gives, 

(16)  *0-  -/4(l)  =  1  -  T  a,. 


7T0  =  -A  (!)  =  1  -  £  a,. 

/-! 


This  means  that 

I  a,  <  1. 

If  Q  is  a  random  variable  representing  the  state  of  the  system.  E(Q )  can  be  found  as: 

(17)  £(£>)  =  £  nn„  =  -,4'(1)/.4<1)  -  £  ./a,/*,,. 

n— o  /To 


The  ideas  just  presented  shall  now  be  demonstrated,  using  the  examples  discussed  above. 
First,  consider  the  discrete  G//G/ 1  queue.  Specifically,  suppose  that  the  distribution  of  interar¬ 
rival  time  and  the  distribution  of  the  service  time  are  as  given  in  Table  1. 

TABLE  1  —  Distribution  of  the  Interarrival  Time 
and  the  Service  Time. 


k 

1 

2 

3 

P(S  =  k) 

0.4 

0.3 

0.3 

P(A  =  k) 

0.1 

0.3 

0.6 

Using  these  values,  one  can  find  the  ek,  -g  <  k  <  h,  from  equation  (9),  and  the  eK  in  turn, 
determine  the  a,,  1  ^  j  <  h  (see  Sections  4  and  5).  For  the  problem  described  by  Table  1. 
one  finds, 

a,  =  0.23156 
a2  =  0.05039. 
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Equations  (16)  and  (17)  give  now 

nu  -  P(  W  =  0)  =  1  —  a,  -  a2  =  0.71804 

E(W )  =  (o,  +  2  a2)/P(  IT  =  0)  =  0.46287. 

Since  7r_  (  is  assumed  to  be  zero,  one  can  use  (12)  to  calculate  tti,  ir2 ,  77-,  and  so  on. 
77,  =  7T()<7 1  +  7T_|fli  =  0.1663 
77  i  =  TT  i  ZZ  i  +  TTnfli  =  0.0747 


Since  all  a,  >  0,  this  recursion  is  numerically  very  stable. 

Next,  consider  the  Glh/M/\  queue.  Specifically,  let  the  arrival  time  distribution  be  deter¬ 
ministic,  let  h  =  2,  k2  ~  0.4  and  ju  =  1.  The  ak  of  equation  (10)  are  in  this  case  Poisson- 
distributed  with  parameter  2.5.  One  finds  in  this  case 

a,  =  0.41045 

a2  =  0.13711. 

Again,  all  entities  of  interest  can  be  calculated.  L,  the  expected  number  in  the  system  proceed¬ 
ing  an  arrival  becomes  for  instance 

L  =  (a,  +  2a2)/(l  -  a,  -  a2 )  =  1.5132. 


Finally,  consider  the  Mx/ My/\  queue  of  Example  3.  Specifically  let 

=  \2  =  Ml  =  M’  =  M.t  =  *• 

In  this  case,  the  a,  turn  out  to  be 
a,  -  0.34960 
a2  =  0.24582. 

The  initial  conditions  are  now 
0  =  —2tto  +  77, 

0  =  7T0  —  277,  +  77,  +  i t4, 

7 T}  and  7r 4  are  given  by  (12)  as 

Tty  =  ai7T2  +  a2  7r ,  =  0.34960tt2  +  0.2458277| 

7T  4  =  £7|7r3  +  a2  77  2  =  (a,2  +  02>772  +  a,a277,  —  0.36804772  +  0.0859477(. 
Using  these  values,  the  initial  conditions  become 

0  -  2t70  +  0.2458277,  +  0.34960t72 

0  -  770  -  1 .6682477,  4-  0.71  764t72. 

As  a  third  equation,  one  uses  the  normalizing  condition  as  given  by  (14). 

—  1  +  a,  +  a2  —  i7n(— 1  +  a,  +  a2)  +  tt,  (—1  +  a,)  —  n2 
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or 

0.40458  =  0.40458tt()  +  0.6504077,  +  ir2. 

These  3  equations  ean  be  solved  for  rr(|.  77  ,  and  n2. 

The  result  is 

ir„=  0.06741.  77,  =  0.15840,  n2  =  0.27428. 

It  is  now  simple  to  find  L  from  Equation  (15).  One  has 

L  —  [-<77,  +  2t7:)  +  a |  (770  T-  2 77 ,  —  1 )  +  a2  (277,,  —  2 ) ]/ ( —  I  +  a |  +  a2) 

=  3.4128. 


Thus,  onee  the  a ,  are  found,  the  problem  is  solved  easily.  The  next  two  sections  show 
several  methods  to  find  the  a,. 


4.  THE  SIGNIFICANCE  OF  a,  AND  THEIR  CALCULATION 


In  this  section,  it  is  shown  that  equation  (12)  holds,  and  how  one  can  find  the  a,  in  the 
general  case.  From  the  initial  conditions  and  from  the  queueing  equation,  one  finds  the  follow¬ 
ing  expression: 

Piz)  -  Ui z)/Viz). 


Here  Uiz)  is  a  polynomial  of  degree  r  +  x'  —  1  which  is  of  no  further  interest,  and  l'(r)  is 
equal  to 


K  +  h 

F(e)=  X 

*-n 


-k  _  -K 


It  is  well  known  (see  e.g.  15])  that  F(r)  has  a  zero  at  :  —  1.  j?  —  1  zeros  inside  the  unit  circle 
and  h  zeros  outside.  It  follows  that  K(z)  can  be  written  as 


Viz)  =  A  (:)  Biz). 


Here 


/f(r)  =  —  l+a,z  +  a2z2  +  . . .  +  ahzh 


contains  all  the  li  zeros  outside  the  unit  circle. 


Because  of  (he  convergence  of  P(z),  Uiz)  must  be  divisible  by  Biz).  Since  U(z)  has  a 
degree  of  r  +  k  -  I,  and  Biz)  has  a  degree  of  C(r>  has  a  degree  of  r  -  1.  Consequently. 

r~  I 

Uiz)/ Biz)  =  Ciz)  =  £  c,z‘. 

i-0 


or 

Piz)  =  Ciz)/ A  (z). 
This  equation  can  be  written  as 
(18)  Piz) A  (z)  —  Ciz). 
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The  coefficient  of  of  the  product  P(r).4  <r)  equals,  as  is  well  known 

£  a,‘n„-r 

for  n  ^  r  -  1.  this  means 

i-n 

For  n  >  one  finds 

(20)  £>,*„_,=  0. 

,=o 

Equation  (20)  is  identical  with  Equation  (12),  and  Equation  (18)  and  (19)  can  be  combined  to 
give  Equation  (13).  This  proves  Equations  (12)  and  (13). 

From  the  above  discussion,  the  significance  of  the  a,  becomes  clear.  1  (e)  can  be  factor¬ 
ized  into  two  factors  .-Me)  and  fl(;),  and  the  a,  are  merely  the  coefficients  of  A  (cl,  provided 
a,,  is  defined  as  -1.  The  problem  is  thus  reduced  to  finding  4(:).  The  most  convenient 
method  to  do  this  seems  to  be  the  following  one.  One  starts  with  the  identity,  which  will  be 
proven  in  Section  6. 

a,  =  (',+  £  e_,  a,  ,.  1  <  ,/  <  Ih 

i- 1 

do,,  «  a,  .  I  </'</; 

a,*!.,  =  a,. i a,  +  «u+i.  1  <  ;  <  g  ~  1 .  I  <  ./  <  h 

Furthermore.  au  is  to  be  taken  as  zero  fo.  j  >  h.  Equations  (21)  and  (22)  can  be  used  to  find 

a,  by  successive  approximation.  As  starting  values,  one  can  use  a,"  =  t>(,  ./'  =  1,2 .  h. 

These  values  can  be  used  to  replace  the  a ,  on  the  right  of  (22).  and  a  new  approximation  for  a, 
can  be  found  from  Equation  (21).  In  this  way,  one  continues  uniil  a  suitable  stopping  criterion 
is  satisfied. 

We  tried  two  different  stopping  criteria.  First,  we  required  that  the  n,  generated  by  Equa¬ 
tion  02)  satisfy  the  queueing  equation  with  a  specified  precision  e.  It  was  found  that  for  high 
values  of  h  and/or  #,  this  criterion  performed  poorly.  The  reason  for  this  is  simple.  If  h  is  say 
100.  a  small  change  in  d |IHI  will  have  a  dramatic  impact  on  p  and,  consequently,  on  the  perfor¬ 
mance  measures  of  the  system.  As  an  alternate  stopping  criterion,  we  used  the  change  of  the 
calculated  /f'(l)  =  I  ja/  to  decide  when  the  precision  of  the  a,  is  adequate.  Since  /f'(l>  is 
closely  related  to  the  expected  number  in  the  system  (see  Equation  (17))  this  seems  to  be  a 
good  stopping  criterion. 

If  there  are  q  iterations,  one  needs  Aqgh  operations  to  find  all  ar  The  reader  may  want  to 
verify  this.  If  one  adds  all  e_,a,,  to  the  sum  at  the  right  of  Equation  (21)  as  soon  as  the  a,, 
becomes  available  one  only  needs  to  store  the  a, ,  for  the  current  value  of  i.  If  this  is  done,  the 
algorithm  requires  only  an  array  area  of  4 h  +  gas  the  reader  may  verify.  The  algorithm  is  thus 
efficient  and  does  not  require  huge  arrays. 

The  algorithm  described  above  was  programmed  in  FORTRAN  and  run  on  the  DEC 
2060.  The  execution  times  were  negligible.  A  problem  with  h  -  100  and  g  -  100  and  p  =  0.9 


(21) 

with 

(22) 
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< »nU  required  8  4  seconds  and  53  iterations  until  all  a,  were  obtained.  At  this  point  there  was  a 
relative  change  of  4  '<  I )  of  less  than  0.0001  between  two  iterations.  The  computer  time  to  find 
i he  *i  lor  e  =  100  and  It  =  5  is  0.2307  seconds  (19  iterations).  The  method  performs  thus 
v ci v  well 

5.  ALTERNATE  METHODS 

In  gain  a  cheek  for  our  results,  and  also  to  get  further  insight  into  the  problem,  we 
decided  to  trv  other  methods  as  well.  These  methods  are  based  on  C(z)  =  —  1),  that 

/is  1  i ,-i  was  deflated  by  z  —  I  l(z)  can  be  written  as 

Ur)  =  4  (zlfl(z). 

where  fit:)  is  equal  to  5<z)/(z  —  1)  This  deflation  decreases  the  degree  of  the  polynomials 
one  works  with  by  one.  and  it  also  increases  the  difference  between  the  zeros  of  A(z)  and 
fit:),  improving  thus  the  efficiency  of  the  algorithms  to  be  discussed.  A  further  deflation  by 
the  onlv  positive  /ero  of  .4  (z)  is  possible. 

To  factorize  l<z).  two  methods  were  used.  The  first  one  is  given  in  (1,  page  158],  The 
second  one  used  the  fact  that  E<z)  can  be  interpreted  as  the  characteristic  polynomial  of  a 
difference  equation  If  these  difference  equations  are  used  to  calculate  values  x„,  ar.+j,  xtli.2 
recursively,  the  zeros  greater  than  one  will  dominate,  and  eventually,  the  effect  of  B(z )  will 
become  negligible  The  ,v„  are  thus  almost  identical  to  a  series  generated  by  a  difference  equa¬ 
tion  that  has  4(c)  as  characteristic  polynomial.  In  other  words,  the  ,v„  will  almost  satisfy  the 
following  relationship 

h 

(23)  ,v„  =  £  a,  x„+r 

i-t 

If  x„  is  known  for  n  -  k.  k  +  I . k  +  2h  -  \,  this  gives  h  equations  for  the  h  unknown 

«/• 


The  x„  will  satisfy  Equation  (23)  precisely  if  the  initial  conditions  Xq,  X| . xA_i  satisfy 

(23).  Consequently,  one  can  repeal  the  above  algorithm  several  times  to  gain  a  higher  preci¬ 
sion  for  the  ar  In  each  repetition,  one  uses  the  a ,  obtained  in  the  previous  iteration  in  order 
to  calculate  the  starting  values  needed.  In  the  first  iteration,  one  can  use  a,n  —  dr  This  algo¬ 
rithm  gave  good  results  for  most  cases  we  tried 

The  disadvantage  of  the  two  algorithms  just  mentioned  is  that  they  require  the  solution  of 
h  equations  in  h  unknowns  during  each  iteration,  and  this  requires  h}  operations.  This  means 
that  they  are  inefficient  as  compared  to  the  algorithm  suggested  in  the  previous  section,  at  least 
if  h  is  high. 

Of  course,  A  (z)  can  also  be  found,  provided  one  knows  all  the  zeros  of  E(z)  outside  the 

unit  circle.  Let  Z\.Z2.Z) . zh  be  these  zeros-  A  (z)  becomes,  given  one  uses  the  fact  that 

a  o  -  -1 

A  (z)  =  — (z  —  Z\)  (z  —  Z2)  ...  (z  —  Zh)/[(—Z\)  (— Z2)  ■  ■  ■  —  (— 2*)] 

—  —  (1  —  z/z\)  (1  —  z/z2)  ■■  ■  (1  —  z/zh). 

The  factors  can  now  be  multiplied  in  the  usual  way,  (see  e.g.  [4])  giving 
A  (z)  —  -1  +  a,  z  +  a2z2  +  . . .  +  ahzh. 
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Note  that  this  procedure  even  works  for  multiple  roots,  provided,  of  course,  all  are  known. 

A  final  comment  shall  now  be  made.  Suppose  one  has  the  h  +  g  probabilities  irA  +  l. 

rrA+: . 7r Then,  one  can  theoretically  calculate  nm,  m  >  k  +  h  +  g,  recursively  by 

using  Equation  (2),  that  is 

_ 1_  ± 

ft  n+K  ft  ti  2*  ^7  ft  n- A 

‘-l’  A  —  e-H 

This  recursion  is  numerically  unstable,  that  is,  the  round-off  errors  will  increase  with  each  itera¬ 
tion.  The  characteristic  equation  of  this  difference  equation  is  fll/.vl  =  ,4(l/.v>  B(  l/.v).  and 
any  recursion  based  on  it  will  eventually  be  dominated  by  S(l/.v).  (Indeed,  this  very  effect 
was  used  earlier  to  find  A(:)s  Instead  of  doing  such  a  recursion,  it  is  better  to  use  2h  subse¬ 
quent  it „,  and  use  (12)  to  obtain  It  equations  for  the  a,,  j  =  1.2 . h. 


6.  THE  SIGNIFICANCE  OF  THE  a,  , 

This  section  proves  Equations  (21)  and  (22)  and  establishes  the  relationship  of  our 
method  with  the  method  of  Neuts.  We  start  to  prove  the  following  equation,  in  which  the  a, , 
are  calculated  as  given  by  (22) 

h 

(24)  ft „  +  i  =  X  a,.,ft„-i  •  /  >  0.  n  >  r 

./-i 

Since  a» ,  =  ar  this  equation  is  certainly  correct  for  /  =  0.  Moreover,  if  it  is  true  for  i,  it  is 


4-  1 .  One  has,  if  <7,/l+|  =  0 

h  +  \ 

ftn+,+\  =  X 
/-I 

au7r  n  +  \-i  ~  ai.\7r  n 

"  1 

h  h 

X  aiftn-<  +  X  a" 

/=  1  /-I 

h 

-  I 

k.  I«;  +  0,.,  +  Mn-i 

Because  of  (22),  this  gives 


ft  n  + 1  +  I  X  ^/  +  1 .  /  ft  n—  r 

I- 1 

This  proves  that  Equation  (24)  is  correct  for  /  +  I,  given  it  holds  for  i.  To  prove  Equation 
(21).  one  rewrites  the  queueing  Equation  (2)  as 

1  h 

tr,=  £  e_,7 r„+,  +  £  e,i r„_r 

,-T 

Using  Equations  (12)  and  (24)  this  gives 


h 


X  a’ft «-i 

I- 1 


+  X e-<  Z  a'in'' 

,-l 
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Consequently, 

V 

a,  =  e,  +  £  e_,al  r 

i- 1 

and  Equation  (21)  is  proven. 

Equation  (24)  can  also  be  used  to  simplify  the  initial  conditions.  Consider  for  instance 
Equation  (4)  which  can  be  written  as 

oo  r—  1  °° 

ft  n  ft  m  Pm.  n  21  ^ n  X  ^ 'r  +  w.  n  ft  r  +  m  •  ^  ^ 

m»0  m*0  w*“0 

If  nr+m  is  replaced  by  (24),  one  obtains  after  some  calculation 

r—  /»—  I  r- 1  oo 

ft n  ft mPm.n  £  ft  m^Pm.n  Pr  +  k.n  ^kr-m^'  0  ^  H  ^ 

w«*0  nt-r—fi 

Together  with  Equation  (14),  this  provides  a  set  of  /-independent  equations  in  the  /-unknowns 

”■(!•  77  1 . W,_|. 

We  now  compare  our  results  with  the  ones  obtained  by  Neuts.  [3,41  Neuts  sets  up  the 
following  matrix  equation  for  the  unknown  matrix  R. 

(25)  R  =  £  R"A„. 

The  A„  are  problem-dependent  matrices.  In  our  case,  they  become 

A„  —  [a,",!.  >i  =  0, 1.2 . (,e/ // 1  +  i 

Here,  (#/)/l  is  the  lowest  integer  above  #//;.  and 

■^i.i  ~  C/i- 1  +  i-nlr 

All  other  A„  are  zero.  As  before,  all  ek  outside  the  range  -g  <  k  <  It  are  assumed  to  be  zero. 
If  one  defines 

‘7,k  ~  f77" A/i+  I  •  7rt/i  +  2 . 7r*/i  +  /J- 

one  has  according  to  Neuts 
t rA  +  l  =  nkR. 

If  R  =  [/•,,),  this  gives 

h 

+  +  /  3“  ^7Tkh+jri.i 


h 

"^n  +  i  =  ^  fl  +  t- fOi+l- M' 

/-I 

When  this  equation  is  compared  with  (24),  one  finds 

au  =  ,'/l  +  l-/.l-H  • 

In  particular, 

a0.i  ”  Oj  ”  + 1  —  /.  I  • 
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The  at  are  thus  contained  in  ihe  first  column  of'  R  We  also  noie  that  liquation  (21)  follows 
from  Equation  (25).  as  the  reader  may  verify. 

To  calculate  R  according  to  (25)  one  has  to  find  the  terms  R"A„.  n  =  2.3 . ig/ h  I  +  I 

and  this  can  be  done  in  2  [#//>]  matrix  multiplications.  However,  each  matrix  multiplication 
requires  2 h‘  operations,  which  means  that  there  are  4  ti'lg/hl.  or  approximately  4 /re  opera¬ 
tions  per  iteration  to  find  R .  not  counting  the  matrix  addition  Thus,  by  taking  advantage  of 
the  special  structure  of  the  problem  one  obtains  a  more  efficient  algorithm,  namely  the  one 
described  in  this  paper. 

The  equivalence  of  the  a ,  with  rh+ 1_,|  is  of  great  importance.  In  particular.  Neuts  ( 2 1 
proved  that  all  r, ,  are  nonnegative.  Consequently,  all  a,  are  nonnegative  Neuts  also  finds  that 
the  r, ,  have  a  probabilistic  interpretation.  Indeed,  the  r,  ,  are  the  expected  number  of  visits  to 

state  n  +  H  +  i  starting  at  n  +  I  without  hitting  In  +  I . n  +  i . n  +  /;).  This  means 

that  a,  gives  the  expected  number  of  visits  to  state  n  +  /  starting  at  ii  without  hitting 
I «+./—/» . n  +  /  -  1 1 . 

7.  CONCLUSIONS 

This  paper  derives  an  extremely  ellicient  algorithm  which  can  be  used  to  numerically 
determine  the  slate  probabilities  of  many  queueing  problems  efficiently  and  precisely.  This 
algorithm  was  obtained  a  by  combining  the  classical  approach  with  the  approach  of  Neuts. 
Indeed  it  bridges  the  gap  between  these  two  approaches  and  opens  new  horizons  for  further 
research. 
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ABSTRACT 

This  paper  examines  the  process  by  which  a  user  of  a  queueing  system 
selects  his  arrival  time  to  the  system  to  compensate  for  unpredictable  delays  in 
the  system  if  he  wishes  to  complete  service  at  a  particular  time  Considering 
the  case  in  which  all  the  system  users  have  already  decided  on  their  arrival 
limes  to  the  system  and  will  not  change  these  times,  this  paper  investigates  how 
a  new  user  of  this  system  develops  his  strategy  for  selecting  his  arrival  time 
The  distribution  of  this  customer's  arrival  time  is  then  obtained  for  a  special 
case. 


1.  INTRODUCTION 

If  a  user  of  a  queueing  system  can  choose  when  to  enter  the  system,  then  he  would  prob¬ 
ably  choose  to  enter  when  the  delays  he  experiences  will  be  minimum.  However,  if  he  wishes 
to  complete  receiving  service  at  a  particular  time,  called  the  "target  time,"  then  he  would  have 
to  arrive  at  the  system  early  enough  to  allow  for  his  service  time  and  unpredictable  delays  in 
the  system.  This  allowance  was  termed  "headstart"  by  Gaver  [31  and  the  same  terminology  will 
be  adopted  here.  The  strategy  by  which  a  user  selects  his  headstart  is  studied  in  this  paper. 

Consider  the  case  in  which  all  the  system  users  already  have  fixed  headstart  strategies  and 
will  not  change  these  strategies  under  any  condition.  This  paper  investigates  the  case  in  which 
a  new  user  who  wishes  to  use  the  system  develops  a  headstart  strategy.  This  will  be  done  by 
assuming  that  this  user,  whom  we  name  U  (cf.  Alfa  and  Minh  [2]),  attaches  some  perceived 
costs  to  delays  and  to  early  and  late  completion  of  service,  and  that  he  selects  his  headstart  in 
order  to  minimize  his  total  cost.  Gaver  [3]  obtained  the  "best"  headstart  strategy  for  this  user, 
in  this  type  of  situation,  on  the  assumption  that  the  user  wishes  to  minimize  his  expected  total 
cost.  This  paper  proposes  the  headstart  strategy  usually  adopted  by  such  a  user,  on  the  assump¬ 
tion  that  he  selects  his  headstart  each  day  in  order  to  minimize  the  total  cost  he  incurs.  It  is 
assumed  that  on  the  first  day  he  uses  the  system  he  does  not  know  what  the  delay  distribution 
is  and  therefore  selects  his  headstart  arbitrarily,  and  hence  probably  incurs  high  total  cost. 
However,  for  the  following  day,  he  uses  his  knowledge  of  the  previous  day's  outcome  to  try 
and  reduce  his  total  cost;  and  this  continues  until  a  stage  is  reached  in  the  long-run  when  he 
settles  for  a  particular  headstart  and  does  not  change  any  further.  Let  us  call  this  stage 
"steady-state."  This  headstart  reflects  his  arrival  time  at  the  system  and  our  interest  is  in  the 
distribution  of  his  arrival  lime  at  the  system  at  steady-state. 
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This  paper  does  noi  restrict  its  analysis  to  nonrush  hour  traffic  or  linear  cost  parameters  as 
done  hy  Gavcr  13]. 

2.  PROBLEM  FORMULATION 

2.1  Assumptions  and  Definitions 

Consider  a  queueing  system  used  by  a  finite  number  1,(1  >  0),  of  customers  who  behave 
identically  and  independently.  For  each  day.  let  us  observe  the  system  at  equally  spaced  epochs 
sequentially  numbered  0, 1,2,3....  and  assume  that  all  the  customers  arrive  at  and  depart  from 
the  system  only  at  instants  immediately  prior  to  these  epochs. 

It  will  be  assumed  that  once  a  customer  is  served  he  does  not  return  to  the  pool  of  poten¬ 
tial  customers.  This  assumption  and  that  about  the  number  of  customers  being  finite  are  not 
necessary  for  the  analysis  but  they  allow  us  to  use  the  results  from  Alfa  [1]  and  Minh  [4). 

Assume  that  each  customer,  considered  separately,  has  the  time-dependent  probability  A„ 
of  arriving  at  the  system  prior  to  epoch  n  +  1,  given  that  he  has  not  arrived  at  the  system  by 

epoch  n  ( //  =  0, 1 ,2.3 _ ) .  Assume  that  the  service  times  of  the  successive  customers  are 

independent,  identically  distributed  random  variable  S  in  the  set  (1.2 . M\:  M  <  °o. 

Suppose  the  new  customer.  U  who  also  wishes  to  use  this  system,  chooses  to  arrive  at  the 
system  at  epoch  a  =  n.  Let  the  delay  he  experiences  by  so  doing  be  W".  The  distribution  of 
W"  can  be  obtained  as  shown  in  Alfa  (11.  Note  that  the  addition  of  this  customer  U  to  the  sys¬ 
tem  has  raised  the  total  number  of  customers  to  /  +  I.  Although  the  distribution  of  ir s  arrival 
time  is  not  necessarily  the  same  as  that  of  the  other  /  customers,  the  results  in  Alfa  [1]  can  still 
be  applied  keeping  in  mind  that  W"  is  a  conditional  random  variable.  The  distribution  of  If  s 
arrival  time  shall  be  developed  in  this  paper. 

Let  V"  be  the  time  spent  in  the  system  by  U.  given  that  he  entered  the  system  at  epoch  n, 
then  V"  =  W"  +  S.  Let  F"  A  Pr|V"  =  /}. 

For  practical  purposes  it  will  be  assumed  that  U  shall  arrive  at  the  system  only  at  an  epoch 
between  I  and  ,V;  where  1  <  ,V  <  °o. 

Suppose  U  wishes  to  complete  service  at  epoch  r,  where  for  convenience  1  <  r  $  ,V.  If 
he  arrived  at  the  system  at  epoch  a  =  ;/,  got  delayed  W"  =  i  units  of  time  and  took  S  =  /  units 
of  time  for  service  then  he  would  complete  service  at  epoch  n  +  i  +  ./.  One  of  his  objectives  is 
for  n  4-  /  +  j  to  equal  t.  However,  if  /?  +  /  +  /<  r  or  «  +  /'  +  ./>  r  then  he  is  early  by  an 
amount  r  -  n  —  /  -  /  or  he  is  late  by  an  amount  n  +  /  +  j  -  t.  respectively.  He  attaches  a 
cost  to  either  of  the  outcomes  and  these  costs  are  not  necessarily  the  same.  In  addition,  he 
attaches  a  cost  to  the  time  spent  in  the  system  i  +  j.  He  would,  therefore,  incur  a  total  cost 
Cin,  i  +  j),  where  CO/,  /  +  ./)  is  the  sum  of  the  cost  of  time  spent  in  the  system  and  the  cost 
for  either  earliness  or  lateness,  depending  on  which  is  the  outcome,  keeping  in  mind  that  if 
n  +  /+./'  =  r  then  he  only  incurs  the  cost  for  his  time  in  the  system. 

Let  C„  be  the  total  cost  incurred  by  U  given  that  he  arrived  at  the  system  at  epoch  n 

where  C„  can  only  assume  the  values  in  the  set  {C(n,/)U—  1.2 . (/+  II  x  M\.  For 

brevity,  C„  will  be  termed  the  cost  associated  with  epoch  n. 
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Let  us  define  the 

following  se; 

ts  that  will  be  used  in  the  next  subsection 

til 

G  A  1  / 

o  =  1.2.  ... 

.  .  t  /  +  1 1  x  M\ 

lii) 

L  A  |/| 

\i  =  1.2.3. 

. \| 

liiil 

L,  A  (/ 

1/  =  1.2.3. 

. . .  .  n  —  1 .  n  +  1 . V } 

lie) 

If  A  / 

n  l.„, 

If  there  are  v  epochs.  (I  <  v  57  V  -  2),  (.V  >  2).  labelled  tnt.  nu . mv  for  which 

Cm  =  Cm|  =  Cm,  =  . .  .  =  C„;i  where  Vm,  €  L"r  r  =  1.2 . v.  then  let 

(v>  L'f  A  {ill  =  in | .  in 2 . w.:  Vm,  €  /  =  1.2 . v|. 

Let 

(vi)  L " ^  A  -  L,"  i.e.  L "  v  is  the  set  difference  of  Lm  and  L” 

2.2  The  Model 

2.2.J  The  General  Model 

Suppose  on  the  t/"1  day.  (d  =  1,2,  ...).  U  arrived  at  the  system  at  epoch  aJ=  n.  Let 

a'J  A  Pr{a‘/  =  n|,  n  €  L.  U  would  incur  a  total  cost  C„.  and  by  definition 

(1)  Pr{C„  =  Cin.  /)}  =  V”. 

We  assume  that  If  s  choice  of  arrival  time  epoch  for  one  day  depends  entirely  on  the  out¬ 
come  of  the  previous  day's  choice  of  arrival  time  i.e.,  on  the  total  cost  incurred.  V  wishes  to 

minimize  his  total  cost.  Hence.  arf+l  is  a  decision  variable  with  an  action  space  L.  and  a*7*1 
depends  on  the  cost  incurred  on  day  d.  Given  the  action  on  the  <fh  day  the  action  for  the 
(d  +  1  ),h  day,  a‘/+1,  is  thus  a  Markov  Chain. 

Let  tnm  A  Pr(a‘y+I  =  mla1'  =  n]  be  the  transition  probabilities  of  the  chain  These  transi¬ 
tion  probabilities  define  U s  strategy. 

If  s  intention  is  to  choose  an  arrival  time  epoch  that  minimizes  his  total  cost  each  day  he 
uses  the  system,  hence  for  the  (d  +  l)lh  day,  his  strategy  for  choosing  his  arrival  time  epoch 
will  be 

(2)  -  Pr{C„  <  Cu;  V«  €  L„); 
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The  arguments  leading  to  (2>  and  (3)  above  are  as  follows: 

(a)  If  L  arrived  at  the  system  at  epoch  n  on  the  ifh  day  he  will  not  change  his  arrival 
time  to  any  other  epoch  on  the  id  +  l)lh  day  if  he  can  not  reduce  his  total  cost  by  so 
doing.  This  leads  to  (21 

(b)  If  L  arrived  at  the  system  at  epoch  won  the  </"’  day  he  will  change  his  arrival  time  to 

epoch  m  on  the  (d  +  \  )'h  day  if  this  could  reduce  his  total  cost,  prov  ided  there  is  no 
other  epoch  which  has  equal  potential  reduction  in  total  cost  as  epoch  m—  this  leads 
to  the  first  term  on  the  R.H.S  of  (3).  In  addition,  however,  if  there  are  v  other 
epochs,  other  than  n  and  nu  at  which  U  could  choose  to  arrive  at  the  system  and 
achieve  the  same  reduction  in  to’.:.!  cost  as  arriving  at  epoch  nu  then  L  would  arrive 
at  any  of  these  v  epochs  or  epoch  m  with  the  same  probability,  provided  the  total 
costs  to  be  incurred  at  each  of  these  epochs,  including  epoch  nu  is  les-.  than  at  any 
other  epoch.  This  leads  to  the  second  term  on  the  R.H.S.  of  (3)  (2)  and  (3)  above 

constitute  the  transition  probabilities  that  deline  L~ s  strategy 

Define  an  ,V  x  ,V  matrix  T  =  (/„,„)  and  an  ,V  vector  A,/  =  [ci‘‘.  a* . u{ ].  Then  L 

selects  his  (</  +  I)lh  day's  headstart  according  to 

(41  =  A1'  x  T. 

THEOREM  1:  £f,m=l. 

«i-i 

PROOF:  Consider  one  epoch  n.  in  €  L).  and  the  associated  cost  C„.  If  we  compare  this 
cost  C„  to  the  costs  associated  with  other  epochs  in  the  set  L„  then  C„  is  either  the  minimum 
cost,  one  of  the  equally  minimum  costs  or  neither  of  the  two.  This  can  be  stated  as 

<5>  Pr{C„  C„;  Vu  €  L„\  +  Pr|C„  <CU:  V«  €/.„}=!. 

Substituting  i„n  for  the  first  term  on  the  L.H.S.  of  (5)  gives 

<6)  tn,„  +  Pr{C„  <C„;  V//  €/.„)=!. 


The  second  term  on  the  L.H.S.  of  (6)  is  the  probability  of  having  at  least  one  epoch,  in 
the  set  L„,  whose  associated  cost  is  less  than  C„.  This  implies  that  there  is  at  least  one  epoch, 
other  than  n,  whose  associated  cost  is  minimum.  If  there  is  only  one  such  epoch  then  the  pro¬ 
bability  of  such  event  is  given  by 

£  Pr(Cm  <  C„  :  W  €  LJ. 

\mUn 


If.  however,  ihere  are  v  other  epochs  (I  ^  v  ^  S  -  2),  labelled  m Jt  m: . Vm,  6 

I  <  r  ^  v,  such  that  C,„  =  C„l(  =  CWl  =  . . .  =  C\7  ,  then  the  probability  of*  such  event  is  given 
by 


X-2 


I  I 

Vm€  l.n  v-l 


V  +  1 


P  r  { C  =  Cm  <  Cw:  Vm,  € 


L" 

\  * 
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The  second  term  on  the  L.H.S.  of  (6)  can  thus  be  written  as; 


(7) 


Pr{C„  «C,/.  Vh  €  L„ )  =  £  Pr{Cm-<  C„;  V»-  €  L„ 


Vm  €  L„ 


V- 2 


+  I  I  PrlC"  =  C«,  <  C-  Vw'  6  *•»»-.  v»'  €  T"  v ! 


Vm«.  v-l 


T  (v.  m  * 

Vw  €  Z.„ 

Substituting  this  into  (6)  gives 

Ui.n  y  Ci.  m  1 

Vw€  /.„ 

V 

i.e.,  X  anc*  l^‘s  Proves  Theorem  1. 

l 


REMARK:  If  r„„  =  I  for  any  n  €  L  then  n  is  an  absorbing  state  and  the  solution  to 
Equation  (4)  is  trivial.  However,  the  occurrence  of  such  a  situation  in  real  life  would  be  quite 
rare  — we  therefore  assume  for  our  present  problem  that  0  <  t„m  <  1,  Vin.m)  6  L 

LEMMA  1:  If  f,(|  =  0,  («,,«>)  €  L.  for  any  nt  ^  n 2  and  t„  tJ|  r*  1,  then: 

(a)  either  tn%  „2  =  0,  V«,  €  L,  and  n2  is  a  transient  state  which  can  be  deleted  from  the 

chain,  or 

(b)  there  exists  at  least  one  nA  ^  «,  such  that  r„4  >  0  and  therefore  state  n2  can  be 

reached  from  state  rt\  via  state  nA,  hence,  all  n  €  L  form  an  irreducible  markov  chain 

(Note  that  «4  €  L). 

PROOF.  Lemma  1(a)  is  a  compliment  of  Remark  above  and  does  not  require  a  proof. 
For  Lemma  Kb),  if  there  exists  «4  ^  nx,  such  that  in  >  0.  then  there  also  exists  at  least 

one  A’  tuple  (i\<h . /\)  such  that  C~(n4./4)  >  Cln,.i2)  ^  ( '(«,./,)  ^ 

C'(«|,/|)  S~  C(nk.ik ):  V'A-  >  4.  V/(  f  C,  I  <  j  ^  A. 

Further  let 

(vii)  Lq  A  \i\inj  =  0;  V«  €  L] 

(viii)  Ln  A  L  -  Ln 

(ix)  Ln  „  A  Ln  0  L„  and  let  Na  be  the  number  of  elements  in  L0. 

Define  an  A0  x  A0  matrix  T0  =  (r„m),  V(n,m)  €  L0 ,  and  also  define  an  A0  vector 
\ -  (u^);  €  L, Equation  (4)  can  now  be  stated  in  a  modified  form,  for  only  the  epochs 

untamed  in  L„.  as; 

‘ 1  A,'/  1  =  A(f  x  T0. 


I  MM  \  2  f„„  >  0,  \rt  €  Llh  hence  all  n  €  Ln  are  aperiodic  states. 


i  i  vtht  K  !'ih: 
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PROOF:  From  !he  cost  structure,  it  is  apparent  that  if  !„„  —  0  then  tmn  -=  0,  V/w  6  L, 
and  hence  n_9  Ln  i.e.,  state  n  would  have  been  deleted  from  the  chain  considered  in  equation 

(8)  (i.e.,  in  L0).  Therefore,  /„„  >  0,  V«  €  Z,0<  and  hence  Lemma  2  follows. 

THEOREM  2:  There  exists  a  limiting  probability  vector  A0  =  lim  Ao'such  that 

d~  °o 

(9)  A0  =  Aq  x  T0 

has  a  positive  solution  which  is  unique. 

PROOF:  The  necessary  and  sufficient  conditions  for  the  existence  of  a  positive  and 
unique  solution  A0  is  that  the  chain  should  be  irreducible  and  aperiodic.  Both  conditions  were 
established  by  Lemma  1(b)  and  Lemma  2  respectively. 

The  interest  of  this  paper  is  in  LT s  choice  of  arrival  time  in  the  long  run,  A0,  which  can 
now  be  solved  for  in  Equation  (9). 

2.2.2.  A  Special  Case 

In  the  general  model  it  was  assumed  that  if  U  arrived  at  epoch  n  on  the  cfh  day,  he  would 
not  change  his  arrival  time  to  any  other  epoch  if  C„  <  Cm,  Vm  6  L„.  Suppose  we  modify  this 
assumption  and  now  assume  that  when  C„  =  Cm  for  any  m  ^  n  we  would  not  rule  out  the  pos¬ 
sibility  of  V  changing  his  arrival  time  to  epoch  m.  In  that  case  we  shall  attach  equal  chances  of 
U  changing  his  arrv;.;  time  to  epoch  m  and  to  him  not  changing  his  arrival  time  from  epoch  n. 
Thus,  l/s  strategy  will  now  be  modified  such  that  the  transition  probabilities  that  define  his 
strategy  will  be  given  by: 

(3a)  tn  m  -  Pr{Cm  <  Cu ;  V«  €  Lm)  + 

£  — XT  Pr'C"'  =  C"c  <  c«;  Wm'  e  L"": 

v-l  v  1 

Vw  6  Lm  J:  Vn,m  6  L. 


where 

(x) 

Lm.v  A  1/1/  - 

. . .  ,  mv)  such  that 

cm  =  cm  =  cm  =  . . 

m  m  | 

.  =  Cm;,  1  <  v  <  /V 

Lm  v  c  L  :  and 

(xi) 

L-m,  v  A  —  v- 

It  is  immediately  apparent  that  the  right  hand  side  of  Equation  (3a)  is  independent  of  n, 
hence  tnm  =  /vm;  Vin.m.v )  €  L.  Let  tm  A  tnm\  V(n,m)  €  L.  For  this  special  case,  therefore, 
all  the  rows  of  T  are  identical,  and  the  steady  state  solution  is  given  by 

|  <n  1  €  L0 

(10)  a"  =  |  0  \n  6  L,). 
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2.2.3.  Illustrative  Example 


For  illustrative  purpose  let, 

/  =  2.  N  =  3.  r  =  3.  M  =  1. 
A„  =  0.3,  A |  =  0.5,  A,  =  1.0, 


Let; 


(ID 


C  (/>,/) 


Q  x  </)  + 


(  x  (r  —  n  —  i)  . 
C'/  X  (/;  +  /—  r  ) 


and  let  C,  =  1.0.  C,  =  1.0,  C,  -  2.0. 


n  +  i  ^  t 
n  +  i  s?  t 


By  using  results  front  Minh  [4]  we  obtain  the  distribution  of  queue  lengths  and  from  Alfa 
[1],  the  distribution  of  waiting  is  obtained  as 


(V," 


.  *  1 

1 

2 

3 

0 

0.73 

0.60 

0.57 

1 

0.24 

0.36 

0.39 

2 

0.03 

0.04 

0.04 

Flence,  F"  “  W^Li,  /  =  1,2,3. 

The  total  cost  C(n,i)  are  given  as 


C(n.i) 


n 

> 

2  1 

3 

1 

3 

2 

3 

2 

4 

5 

6 

3 

7 

8 

9 

The  transition  probabilities  are  thus  given  as 


(12) 

'u  =  vl  ( v2  +  Yj]+  [  Ff  +  F2j  (('j  + 

Fj’l  +  Fj*  Ff  F2, 

(13) 

'..2=  F,2  +  Ff  F,'  [  Fj'  +  F/l. 

(14) 

fi.i-  F,’  [  F2'  +  F,1)  [F22  +  F2]  +  F/Fj'F/, 

(15) 

,  =  F,1  [  F22  +  F32J  IF2  +  Fj  )  +  Ff  [F22  + 

F2  ]  [  Fj'  +  Fj3  ] 

+  F/  F,2  F,’  +  F/ [F22  +  F,2  1  F,V2, 

(16) 

,2  2=  F,2  +  Ff  Fj  [  F2  +  Fj']. 

(17) 

r2..,=  F3F2'  [Ff  +  F2 !  +  [  Ff  4-  Fj']  F,2  F,1 

+  F,'  F,1  [  Ff  +  I 
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(18)  =  [l,1  +  v\  ]  [  v}  +  k,2]f33  +  [i/,1  +  r,'][K22  +  y?  ]  yj> 

+  V\VlV l 

(14)  l}  2  =  v r  +  yj  y]  I  y23  +  u,3], 

(20)  i7,’  [u?  +  Kj]  +  y{  I  +  r,’  )■{  y- . 

Thus,  this  leads  to  the  transition  matrix, 

0.333  0.605  0.062 

(21)  T  =  0.256  0.605  0.139 

0.167  0.605  0.228 

from  which  we  obtain 

(22)  A  =[0.265,  0.605,  0.130], 

Note  that  for  this  example  L0  =  L,  hence  T0  =  T  and  A0  =  A. 

3.  CONCLUSION 

This  model,  based  on  its  assumptions,  can  only  be  used  to  predict  the  customer’s  arrival 
lime  at  the  system  when  all  other  customers  previously  using  the  system  have  settled  for  a  par¬ 
ticular  arrival  time  and  will  not  change  it  under  any  circumstances.  However,  when  all  the 
other  customers  change  their  arrival  times  as  a  result  of  the  new  customer's  "interference"  then 
there  will  be  a  slight  modification  to  the  problem. 

If  other  customers  change  their  arrival  times  everyday,  then  the  distribution  of  W"  will 
change  from  day  to  day  and,  hence,  T  and  T0  will  also  change  from  day  to  day.  If  we  let  ld  or 
T0rf  represent  the  transition  matrix  for  the  strategy  for  the  (d  +  l)th  day  then  Ifs  choice  of 
arrival  time  can  be  reformulated  as 

(23)  Arf+I  =  Ad  x  Td. 

However,  with  (23)  existence  of  a  steady  state  solution  cannot  be  assured.  If  we  further  let 
there  be  more  than  one  server  in  parallel  such  that  a  customer  develops  a  strategy  not  just  for 
selecting  his  arrival  time  only  but  for  joint  selection  of  arrival  time  and  of  the  server  to  serve 
him,  then  the  problem  becomes  similar  to  that  in  Alfa  [2], 
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ABSTRACT 

This  paper  develops  a  forward  algorithm  and  planning  horizon  procedures 
for  an  important  machine  replacement  model  where  it  is  assumed  that  the  tech¬ 
nological  environment  is  improving  over  time  and  that  the  machine-in-use  can 
be  replaced  by  any  of  the  several  different  kinds  of  machines  available  at  that 
time  The  set  of  replacement  alternatives  may  include  (i)  new  machines  with 
different  types  of  technologies  such  as  labor-  and  capital-  intensive,  (ii)  used 
machines.  fiti)  repairs  and/or  improvements  which  affect  the  performance 
characteristics  of  the  existing  machine,  and  so  forth 

The  forward  dynamic  programming  algorithm  in  the  paper  can  be  used  to 
solve  a  finite  horizon  problem  The  planning  horizon  results  give  a  procedure 
to  identify  the  forecast  horizon  T  such  that  the  optimal  replacement  decision 
for  the  first  machine  based  on  the  forecast  of  machine  technology  until  period 
T  remains  optimal  for  any  problem  with  horizon  longer  than  T  and.  for  that 
matter,  for  the  infinite  horizon  problem.  A  flow  chart  and  a  numerical  exam¬ 
ple  have  been  included  to  illustrate  the  algorithm 


1.  INTRODUCTION 

In  our  previous  paper  [5l,  we  developed  forward  algorithm  and  planning  horizon  pro¬ 
cedures  for  a  machine  replacement  model  under  the  assumption  that  only  one  kind  of  machine 
is  available  for  replacement  in  any  given  period.  We  showed  that  there  exists  a  forecast  horizon 
T,  such  that  the  optimal  replacement  decision  for  the  first  period  (based  on  the  forecast  of 
machine  technology  until  period  T  remains  optimal  for  any  longer  (than  T)  horizon  and,  for 
that  matter,  the  infinite  horizon. 

In  this  paper,  we  relax  the  assumption  of  a  single  possible  replacement  alternative  by  mul¬ 
tiple  alternatives.  That  is,  we  develop  a  model  in  which  the  machine-in-use  can  be  replaced  by 
any  of  the  several  different  kinds  of  machines  available  at  that  time.  This  model  can  deal  with 
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many  realistic  situations.  For  example,  the  set  of  replacement  alternatives  may  include  (i>  new 
machines  with  different  types  of  technologies  such  as  labor-  and  capital-  intensive,  (ii)  used 
machines,  (iii)  repairs  and/or  improvements  which  affect  the  performance  characteristics  of  the 
existing  machine,  and  so  forth.  It  should  be  noted  that  the  set  of  replacement  alternatives  can 
change  from  period  to  period. 

This  multiple  alternative  replacement  model  represents  a  major  generalization  of  the  vari¬ 
ous  single  alternative  replacement  models  developed  by  Terborgh  [71,  Thompson  (81.  Sethi  and 
Morton  [6|,  Gordon  (1).  For  a  survey  of  the  extensive  literature  on  the  machine  replacement 
models,  the  reader  is  directed  to  Rapp  [41;  see  also  Pierskalla  and  Voelker  (3). 

In  the  next  section,  we  formulate  the  multiple  alternative  replacement  model  under 
improving  technologies.  In  section  3,  we  develop  an  efficient  forward  algorithm  for  solving 
finite  horizon  problems.  Section  4  develops  a  variant  of  the  regeneration-  monotonicity  pro¬ 
perty  [2,  5]  and  planning  horizon  procedures  for  the  model.  A  flow  chart  summarizes  the  com¬ 
plete  solution  procedure.  A  numerical  example  is  solved  in  Section  5  to  illustrate  the  steps  of 
the  forward  algorithm  and  the  application  of  the  planning  horizon  procedure.  We  weaken  the 
assumption  of  improving  technologies  in  Section  6  and  show  that  planning  horizon  procedures 
can  be  adapted  to  this  case.  Section  7  concludes  the  paper  with  some  important  remarks. 

2.  MODEL  FORMULATION 

Consider  the  situation  of  a  production  shop  which  must  keep  a  single  machine  of  a  partic¬ 
ular  capacity  at  all  times  To  run  this  machine,  the  shop  incurs  operating  expenses  which  may 
include  labor  cost,  electricity  cost,  maintenance  cost,  depreciation,  and  so  forth.  Usually,  the 
performance  of  the  machine  deteriorates,  i.e.,  operating  expenses  increase  over  time,  so  that 
the  shop  might  consider  selling  the  existing  machine  for  its  salvage  value  and  buying  a  new 
one.  This  new  machine  is  selected  from  various  different  alternative  machines  available  If  a 
major  repair  is  decided  on  the  existing  machine,  then  this  alternative  can  be  considered  as  if  a 
new  machine  of  another  technology  is  purchased  at  the  cost  of  repair  plus  the  salvage  value  of 
the  existing  machine.  Once  the  new  machine  is  purchased,  the  same  situation  repeats  with  the 
new  machine.  Consequently,  for  any  finite  or  infinite  horizon,  the  shop  will  make  a  chain  of 
replacement  decisions  [4],  (51.  The  problem  of  the  shop,  of  course,  is  to  find  simultaneously 
the  optimal  times  of  these  replacements  and  the  types  of  machines  selected  at  these  times. 
Obviously,  these  decisions  will  depend  on  the  prices  of  the  future  machines  and  their  per 
period  operating  expenses  over  time.  More  precisely,  we  are  considering  the  following  model 
which  the  shop  must  solve: 

Let  Ai  denote  the  machine  of  technology  h  available  at  time  t.  Let  there  be  machines  of 
N  alternative  technologies  available  to  choose  from  in  any  period.  It  is.  noted  that  the  extension 
to  the  case,  when  the  set  of  alternative  technologies  is  changing  over  time,  is  straightforward 
and  will  be  described  in  Section  7. 

Let  0,hk,  k  ^  r,  be  the  operating  cost  of  the  machine  A,h  in  period  k.  With  reference  to 
our  previous  paper  [51,  we  note  that 

0,h,  =  t it,”  -  Si,  +  MU, 
and 

o,\  -  -  s*k  +  M*k 
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where  it /'  is  ihe  purchase  price  (il  is  assumed  thai  the  new  machine  Aj'  is  purchased  ai  the 
beginning  of  period  /)  and  S,''k  is  the  salvage  value  at  the  end  of  period  A  of  the  machine  A 
T  he  term  A/,\  denotes  all  the  costs,  which  are  specific  to  the  technology  h,  of  operating  the 
machine  A',’  in  period  A  excluding  the  cost  of  depreciation  (or  loss  in  salvage  value  from  period 
A  -  I  to  A). 

To  obtain  a  forward  algorithm,  we  must  solve  the  multiple  alternative  replacement  model 
for  any  given  finite  horizon  P.  Suppose  the  shop  uses  n  machines  during  the  interval  <1  ,  P>. 

Let  h . i„-  |,  t„  be  their  salvage  times  and  let  h\,  h2 . h„  be  their  technologies. 

Note  that  r,  ^  1  for  i  =  1 ,  2,  . . .  ,  (n  —  1 )  and  t„  =  7",  since  we  assume  that  the  shop  goes  out 
of  business  at  the  end  of  the  given  finite  horizon  P.  Define  to  =  0.  Also  note  that 

<p,q>  =  [p,  p  +  I . q\  with  nonnegative  integers  p  and  q  with  q  >  p.  We  can  now  state 

the  finite  horizon  problem  as 

n  -  I  li  *  I  . 

(U  Minimize  £  £  0,+,'* 

,,n 

n.  »i.  1 . . 1 

h\.  h2 . h„. 

In  the  next  section,  we  develop  a  forward  dynamic  programming  algorithm  to  solve  (1). 

3.  FORWARD  ALGORITHM 


We  start  with  defining  the  following  terms: 

Purchase  Point.  A  period  t  is  defined  to  be  a  purchase  point  (or  P-poini)  if  a  machine  is 
purchased  at  the  beginning  of  period  t. 

Regeneration  Point  A  period  t  is  a  regeneration  point  (or  R-point)  if  a  machine  is  sal¬ 
vaged  at  the  end  of  period  t. 

It  is  easy  to  see  that  any  P-point  is  immediately  preceded  by  an  P-poinl.  This  property 
allows  us  to  develop  an  efficient  forward  algorithm  [2]. 


We  need  to  introduce  the  following  notation: 


cm 

CHj.T) 


C*(  T) 


(3) 

Cj(T) 


=  minimum  cost  of  the  T-period  problem  (1), 

=  the  total  operating  cost  for  the  machine 
A*+ 1  in  the  interval  <j  +  I ,T> 

=  £  o/+u 

=  minimum  total  cost  of  the  P-period  problem  when  (./  +  1 )  is  the  last 
P-poinl  and  '/  is  the  technology  of  the  last  machine 

=  C(y)  +  Ch(J  T) 

=  minimum  total  cost  of  the  P-period  problem  with  0  +  D  as  the  last 
purchase  period, 
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Clearly, 

(4)  C,(D  =  min  CHT) 

h 

and 


(5)  C(7")  =  min  CAT) 

y(<O.T-l> 


Equations  (2)-(5)  complete  the  statement  of  the  forward  algorithm.  It  is  possible  to  sim¬ 
plify  the  computation  of  C*(D;  using  (2)  in  (3)  gives  the  following  sequential  procedure 


(6) 


c;(  T )  = 


Cf(r-  1)  +  o;+l.r  for  ;  <  7--  1 

c(T -  [)  +  o$T  y  «  r-  r 


4.  PLANNING  HORIZON  RESULTS 

To  obtain  planning  horizon  results,  we  need  to  assume  that  every  technology  is  an 
improving  technology.  Thus,  for  technology  A,  we  assume  that 

(7)  0,-1. a  >  Ol'k  for  k  >  j  +  \  . 

In  other  words,  we  assume  that  the  operating  cost  in  a  period  for  a  machine  of  technology  h  is 
lower  than  the  operating  cost  for  an  older  machine  of  the  same  technology,  except  perhaps  in 
the  first  period  of  operation  of  the  new  machine  when  the  amount  of  depreciation  on  the  new 
machine  is  usually  high. 

Lower  Bound  Monotonicity  Property: 

Let  j*(T)  denote  the  latest  next  to  the  last  /{-point  in  the  optimal  solution  of  the  T- 
period  subproblem;  note  that  period  T  is  the  last  /{-point.  By  the  tower  bound  monotonicity  pro¬ 
perty  we  mean  that  the  lower  bound  of  j*(T)  increases  monotonically  with  T.  To  prove  this 
property,  we  must  also  define  j*h(T)  to  be  the  latest  next  to  the  last  /{-point  for  the  optimal 
solution  of  the  Ch(  Tl-problem,  where  the  Ch(  D-problem  is  the  T-period  problem  subject  to 
the  constraint  that  the  last  machine  be  machine  of  technology  h. 

THEOREM  I:  Under  the  assumption  (7)  of  improving  technology  over  time,  j*h(T) 
satisfies  the  regeneration  monotonicity  property,  i.e., 

(8)  j*"(T  4-  1)  ^  j*h(T). 

PROOF;  Let  j*h(T  +  1)  —  a  and  y**(D  =  b.  It  is  easy  to  see  that  C^iT)  >  C$(T). 
Furthermore,  if  a  <  A  then  0^+ ,  r+,  ^  0*+ 1 . r+ 1 • 

If  a  <  A,  then  using  (6)  we  can  write 

c*(r+  i>  -  c*m  +  o*+,.r+l 
^  dm  +  o»\,.r+l 

-  CbHT  +  1). 
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This  contradicts  the  optimality  of  the  C*  (  T  +  1  (-solution,  implying  that  a  ^  b.  This  completes 
the  proof, 

THEOREM  2:  If  we  define 

(9)  X(D  «  min  {/**(D}, 

h 

then  X(D  represents  a  lower  bound  of  j*(  T  4-  1)  and  X(D  is  monotonically  increasing  in  T, 
i.e., 

(10)  j* (  T  +  1)  Js  X(D  5*  X(T  -  1). 

PROOF:  The  proof  easily  follows  from  (8)  and  (9). 

Regeneration  Set: 

The  regeneration  set  of  the  C(D -problem  can  now  be  defined  as 

(11)  R(T)  -  sei  -  {k(D,  X(r)  +  1 . T-l). 

We  note  that  a  set  is  known  as  an  R(T)- set  if  it  contains  at  least  one  /{-point  of  an  optimal 
solution  to  any  problem  with  horizon  T or  longer. 

In  many  cases,  it  is  possible  to  reduce  the  size  of  the  /{(D-set.  This  reduction  makes  it 
easier  to  obtain  planning  horizons  as  can  be  seen  in  the  planning  horizon  theorem  below.  The 
reduction  procedure  requires  compulations  of  the  5,'(D-sets  for  all  /;,  where  an  Sh(T)- set  is 
defined  to  be  a  set  of  regeneration  points  such  that  j*h(  T  +  I)  €  5'’(  D- set  for  all  periods 
( T  +  I)  with  ./*'’(  T  +  I)  <  T  and  /  >  0.  It  is  convenient  to  express  this  procedure  by  the  flow 
chart  on  the  next  page. 

The  reduced  R  (  D-set  can  be  obtained  as  follows: 

(12)  R  (  D-set  =  U  S'TD-set. 

h 

It  should  be  noted  that  the  /?( D-set  in  (12)  cannot  be  bigger  than  the  /{(D-set  in  (11).  We 
now  prove  the  following  important  result  for  the  set  5  found  in  the  flow  chart. 

THEOREM  3:  If  a  period  r  S'  the  5-set  found  in  the  flow  chart  for  technology  h.  then 
./**(  T  +  /)  ^  i  for  any  value  of  /  >  0. 

PROOF:  If  i  9  the  5-set  found  in  the  above  flow  chart,  then  there  is  a  period  u, 
T  -  I  >  u  >  /,  such  that  Cj?(  D  ^  C,(  T).  From  the  improving  technology  assumption,  we 
also  have  0//+  u  <  0,+  j  *  for  k  >  T  +  1.  We  can  now  write  for  any  period  T  +  /: 

T+l 

CZ(T  +  /)  -  C*(D  +  1  0*+ 1.* 

k-T+ 1 
T+l 

<  C,*(D  +  I  OX  I,* 

*-r+i 

-  C,h(T+  I ), 

implying  that  jmh(T  +  I)  t*  t.  This  completes  the  proof. 
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Flow  Chari  for  Computing  the  Sh(  D-sel. 


Planning  Horizon  Theorem: 

Let  /*(/•)  denote  the  period  when  the  first  machine  is  salvaged  in  an  optimal  (re¬ 
solution.  We  now  state  the  planning  horizon  theorem  which  gives  us  a  stopping  rule  for  the 
forward  algorithm  to  find  the  optimal  replacement  time  in  an  infinite  horizon  problem. 

THEOREM  4  (Planning  Horizon  Theorem):  If  ./'* ( r )  =  t*  =  constant  for  all  r  €  R(T)- 
set  (12)  for  some  T  such  that  ./*(  T)  ^  0.  then  the  salvage  lime  of  the  first  machine  in  an 
optimal  infinite  horizon  solution  is  r*.  Furthermore,  the  optimal  technology  It*  is  determined 
by  the  condition 

(13)  C,*1  Of)  =  C’„Of>. 


The  first  replacement  time  /*  is  called  the  planning  horizon  and  T  is  called  the  forecast 
horizon.  We  need  only  to  forecast  the  relevant  technology  up  to  period  T  to  find  the  optimal 
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technology  h\  and  the  optimal  replacement  time  /*  for  the  first  machine.  The  reader  is  referred 
to  [2]  for  a  discussion  of  such  theorems. 

We  now  present  a  flow  chart  for  the  complete  forward  algorithm  along  with  the  planning 
horizon  procedure.  We  will  use  this  flow  chart  to  solve  a  numerical  example  in  the  next  sec¬ 
tion. 


Initialize 
T  -  1 

CJ  ( 1 )  —  Oft ,  y**(  1 )  -  0  for  all  h 
C'0a>  -  min  Cg (1  > 

All)  -y*(l)  -  0,  /*(!)  -  1 


r  —  t  +  i 


Find  Cf{T)  from  (6)  for 
K(T-  1)  <  j  <  r-i 


>•*(  T)  -  the  largest  y  that 
minimizes  C*(  T) 
A  (79  -  min  (J'HT)) 


Find  the  S*(7')- set 
and  the  R(T) -set 


Find  C,(D  -  min  C*(T) 
for  ally  6  <K(T),  T-  1> 


Compute  y'*(  79 


9*  (79  -  0? 


f*(r),  r  €  /?( 79-set* 
“  constant?  / 


STOP 

YES  /*(79  is  the 

- *■  planning 

horizon 


j  T  if  y*  ( 79  *■  0 
r(T)  “  1  /*ly*(79]  if 7* < D  > 


Flow  Chart  for  Implementing  the  Planning  Horizon  Procedure 
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5.  NUMERICAL  EXAMPLE 

We  now  illustrate  the  planning  horizon  procedure  by  solving  a  numerical  example.  This 
example  assumes  that  machines  of  two  different  technologies  are  available  for  replacement  at 
any  time.  The  operating  costs  O, J  and  O2  are  as  shown  below: 


j  j 


160  120  130  140  150  160 

■ 

300  60  70  80  90  1 00 ' 

160  120  150  140  150 

1 

300  60  70  80  90 

140  105  115  125 

200  50  60  70 

140  105  115 

I 

400  30  40 

120  80 

1 

400  30 

120 

■ 

200 1 

Note  that  technology  I  can  be  interpreted  as  a  labor  intensive  technology  and  technology 
2  can  be  interpreted  as  a  capital  intensive  technology.  The  calculations  of  C,](T),  C2(T),  and 
C,(T)  are  shown  below: 


c/m _  _ c/m 


T 

j'HT) 

1  2 

0  0 

J 

3 

0 

4  5 

2  4 

6 

5 

T 

j*2(T) 

1 

0 

2 

0 

3 

0 

4  5  6 

0  2  2 

0 

160  280 

410 

550  700 

0 

300 

360 

430 

510  600 

1 

320 

440 

570  710 

1 

460 

520 

590  670 

2 

420 

525  640 

765 

j 

2 

480 

530  590  660 

3 

550  655 

770 

3 

810  840  870 

4 

630 

710 

4 

910  940 

5 

710 

5 

790 

c,m 


\mr 

J*(T) 

rm 

1 

0 

0 

1 

2 

0 

0 

2 

3 

0 

0 

3 

4 

0 

0 

4 

5 

2 

2 

2 

6 

2 

2 

2 

0 

160 

280 

410 

510 

600 

i 

320 

440 

570 

670 

2 

420 

525 

590 

660 

3 

550 

655 

770 

4 

630 

710 

5 

710 

Sample  calculations  for  T  =  3  are  shown  below: 

C/  (3)  -  Of,  +  Of2  +  Of,  =  160  +  120  +  130  -  410, 
C,'(3)  =  C(l)  +  Oj 2  +  =  160  +  160  +  120  =  440, 

C\  (3)  =  C(2)  +  =  280  +  140  =  420, 

j*'(  3)  =  0. 
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C,v  (3)  =  Or,  +  Of?  +  Of,  =  300  +  60  +  70  =  430. 

Cf  (3)  =  C  M  )  +  Or2  +  0}\  =•  160  +  300  +  60  =  520, 

Or  (3)  =  C<2)  +  Of,  =  280  +  200  =  480, 

y*’<3)  =  0,  x (3)  =  mini./*1  (3).  ,/'*2(3)}  =  0. 

0„(3)  =  mint C',}  (3).  C, r  <3>)  =  410. 

C,(3)  =  mintO,1  (3),  <T(2  (3) >  =  440, 

C\(3)  =  min (C’J  (3).  <T?  (3) )  =  420. 

,/*(3)  =  0,  /* ( 3 )  =  3. 

The  planning  horizon  theorem  is  not  satisfied  for  7"  ^  5.  For  example,  for  T  —  5,  we 

have  S1  (5)  =  (4),  S' (5)  =  (2,3,41,  and  R( 5)  =  {2.3.4}.  Since  /*(2)  *  /*(3>  ^  /*(5).  the 

planning  horizon  theorem  is  not  satisfied.  For  T  —  6,  we  have  s'(6)  =  (5),  S2( 6)  =  { 2 . 5 } .  and 

R( 5)  =  { 2 , 5 } .  / * ( 2 )  =  /*( 5)  =  2,  so  the  planning  horizon  is  2  periods  and  the  forecast  hor¬ 

izon  is  6  periods. 

6.  RELAXING  THE  IMPROVING  TECHNOLOGY  ASSUMPTION 

Let  till)  'Z  1  be  a  number  associated  with  technology  h  such  that 
Of.,J+k  ^  OfJ+k  for  k  >  ,(h). 

This  condition  is  likely  to  be  satisfied  by  most  improving  technologies.  The  following 
result  holds  for  this  assumption: 

THEOREM  5:  Let  /**(  T)  <  T  -  rdi)  (0  if  T  rUi))  be  the  latest  period  such  that 
C,*/t,?l  <  e,(T)  for  all  ./'  €  <0,  T  -  r(h)>,  then  ./**(  T  +  1)  5*  /*'’(  70 

PROOF:  Assume  to  the  contrary  that  y**(  T  +  I )  —  a  <  i*h(T )  It  is  easy  to  see  that 

00\,.r.i  5s  Of  */i(D  +  1.  T+  I  and  Cf(T)  >  Cf  *h,r,(T I.  We  can  now  write 

Cf  *h,  Tl(T  +  1)  -  Cf  * A,r,<r )  +  Of  •  Aimi.w 

<  C*(D  +  Oa\,.r.i 

=  c*(r+  i). 

implying  that  j*h(T  +  1)  ^  i*h(T). 

Regeneration  Set. 

Let  k(T)  ■=  min  [/**( 7”>  1 ,  then  the  regeneration  set  of  the  C( D -problem  can  be  defined 

h 

as 

R(T)~  set  Hllfum  +  l . r-  1). 

As  in  Section  4,  it  is  possible  to  find  a  reduced  regeneration  set.  For  this  we  first  find  the 
Sh(  73- set  as  below.  The  reduced  regeneration  set  can  be  found  by  using  (13). 
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Flow  Chart  for  Computing  the  S'(  T)- set 


7.  EXTENSIONS  AND  CONCLUDING  REMARKS 

In  this  paper  we  have  solved  the  infinite  horizon  replacement  model  with  multiple  alterna¬ 
tives.  We  have  obtained  planning  and  forecast  horizons  for  this  model  under  reasonably  gen¬ 
eral  conditions.  Furthermore,  the  model  can  be  easily  adapted  to  situations  where  the  set  of 
alternative  technologies  is  changing  over  time.  This  is  done  by  setting  <?/',  =  =»  for  the  ima- 
i  ginary  zf/'  machine  for  all  /  <  i  for  a  technology  which  appears  at  lime  i  for  the  first  time.  If  a 

*  technology  h  disappears  in  period  t.  then  we  let  O1',  =  °°  for  fictitious  machines  A1’,  for  all 

i  /  >  i  With  these  definitions,  the  model  developed  in  the  paper  is  applicable  to  the  situation  of 

changing  sets  of  technologies 

Another  situation  to  which  the  model  can  be  easily  adapted  is  the  situation  in  which  a 

switch  over  cost  kah  occurs  whenever  a  machine  of  technology  a  is  replaced  with  a  machine  of 

technology  b.  For  this,  we  do  require  the  assumption  that  A"'’  is  separable,  i.e.. 

k“h  =  k“  +  kh. 

In  this  case,  we  can  adjust  the  salvage  value  of  the  existing  machine  downward  b\  amount  k" 
and  adjust  the  price  of  the  new  machine  upward  bs  amount  kh. 

Finally,  we  must  slate  that  the  extensions  of  (he  machine  replnicmcni  model  m  "i>’  r.  > 
oux  paper  [  5 1  can  also  be  solved  in  the  multiple  altemaiixe  case  m  a  tan  U  ai.i  girt  ■« 
manner 
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ABSTRACT 

This  paper  deals  with  flowshop/sum  of  completion  times  scheduling  prob¬ 
lems,  working  under  a  "no-idle"  or  a  "no-wait"  constraint,  the  former  prescribes 
for  the  machines  to  work  continuously  without  idle  intervals  and  the  latter  for 
the  jobs  to  be  processed  continuously  without  waiting  times  between  consecu¬ 
tive  machines.  Under  either  of  the  constraints  the  problem  is  unary  NP- 
Complete  for  two  machines.  We  prove  some  properties  of  the  optimal 
schedule  for  n/ V F,  no-idle/IC’,.  For  n/m/P,  no-idle/X  C,  and  n/m/P,  no- 
wait /EC,  with  an  increasing  or  decreasing  series  of  dominating  machines,  we 
prove  theorems  that  are  the  basis  for  polynomial  bounded  algorithms.  All 
theorems  are  demonstrated  numerically. 


INTRODUCTION 

The  nonpreemptive,  flowshop,  sum  of  completion-times  scheduling  problem  is:  n  jobs 

Ui.J2 . J„)  have  to  be  processed  by  m  machines  (Mi,Af2,  Mm).  Job  7,, 

i  “  1,2 . n,  consists  of,  at  most,  m  operations  (0,i,0/2,  ....  0,m).  Operation  0,7  which 

precedes  0(J+1,  has  to  be  processed  uninterrupted  for  ty  time  units,  on  Mr  j  -  1,2 . m.  ty 

is  a  nonnegative  integer  —  ty  *  0  if  0 is  missing  and  positive  if  it  exists.*  Two  operations  of 
the  same  job  cannot  be  processed  simultaneously  and  a  machine  may  process  at  most  one  job  at 
a  time.  Find  the  operation  sequence  on  each  machine,  that  obeys  the  problem  constraints  and 
minimizes  the  sum  of  completeion  times.  The  problem  is  designated  «/ m/F/LC,,+  where  F 
stands  for  flowshop  discipline  and  C,  is  the  completion  time  of  job  J,. 

The  no-idle  constraint:  machine  Mk,  k—  1,2,  ... ,  m,  works  continuously  without  idle 
intervals. 

The  no-wait  constraint:  job  Jh  /  —  1,2,  ....  n  is  processed  continuously  without  waiting  times 
between  consecutive  machines. 

Both  constraints  arise  in  real  life  situations.  Examples  of  such  scheduling  are:  (i)  under  a 
no-idle  constraint  —  use  of  very  expensive  equipment  (.e.g.,  a  computer  and  its  peripheral  dev¬ 
ices)  with  the  fee  determined  by  the  actual  time  consumption;  (ii)  under  a  no-wait  constraint 
—  in  metal-processing  industries  (e.g.,  hot  rolling)  where  delays  between  operations  interfere 
with  the  technological  process  (e.g.,  cooling  in  the  above  case). 

"For  further  elaboration  of  the  meanings  and  influence  of  zero  processing  times  see  Hefetz  and  Adiri  [4], 
tFor  notation  and  classification  of  scheduling  problems  we  follow  Lenstra  [$)  and  Rinnooy-Kan  [61. 
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While  Che  no-waic  case  has  been  widely  discussed  in  the  professional  literature,  fl)  (61, 
the  no-idle  case  is  defined  here  for  the  first  time.  This  paper  deals  with  flowshop  disciplines 
with  a  no-idle  or  a  no-wait  constraint  where  the  objective  function  being  minimum  sum  of 
completion  times. 

Garey,  Johnson  and  Sethi  [3]  proved  that  n/2/F/ZC,  is  unary  NP-Complete  by  construct¬ 
ing  an  instance  that  is  no-idle  and  no-wait,  and  thus  proved  at  the  same  time  that  of  n/2/F, 
no-idle/XC,  and  n/2/  F,  no-wait/LC,.  Section  2  deals  with  some  properties  of  the  optimal 
schedule  for  n/2/F,  no-idle/XC,.  Sections  3  and  4  are  devoted  to  proofs  of  theorems  that 
underlie  polynomial  bounded  algorithms  for  n/m/P ,  no-idle/XC,  and  n/m/P,  no-wait/XC,  with 
an  increasing  or  decreasing  series  of  dominating  machines,  respectively. 

1.  COMPLEXITY  OF  FLOWSHOP/NO-IDLE  OR  NO-WAIT/SUM  OF 

COMPLETION  TIMES  PROBLEMS 

Garey,  Johnson  and  Sethi  (3]  proved  that  3-partition  is  reducible  to  n/2/F/t.C,.  How¬ 
ever,  as  the  constructed  instance  of  n/2/F/XC,  happens  to  be  no-idle  and  no-wait,  thus  at  the 
same  time  proved  the  unary  NP-Completeness  of  n/2/F/ZCh  n/2/F,  no-idle/XC,  and  n/2/F, 
no-wait /EC,.  Moreover,  only  minor  modification  of  the  proof  is  needed  for  proving  the  NP- 
Completeness  of  the  first  two  problems  where  missing  operations  are  prohibited.  Specifically, 
replacement  of  zero  processing  times  on  M\  (missing  operations)  by  c  >  0  (existing  operations 
with  infinitesimal  processing  times)  and  shifting  of  all  e  to  the  beginning  of  the  schedule  on 
A/],  proves  the  unary  NP-Completeness  of  n/2/ F,  t:]  >  0/XC,  and  n/2/ F,  tu  >  0,  no-idle/XC,. 
The  complexity  of  n/m/F,  >  0,  no-wait/XC,  for  fixed  m  >  2  is  an  open  problem. 

2.  CONSTRUCTION  OF  A  NO-IDLE  SCHEDULE 

We  distinguish  two  ways  of  constructing  a  no-idle  schedule.  Both  are  implemented  con¬ 
secutively  on  the  machines,  starting  with  the  second,  proceeding  to  the  third  and  so  on  until 
the  last  machine. 

(i)  Right  shifts.  We  shift  to  the  right  every  operation  that  precedes  an  idle  interval 
(maintaining  the  constraints  of  the  problem  on  Mk,  k  >  2,  further  right  shifts  might  be 
needed),  until  all  idle  intervals  have  been  eliminated. 

(ii)  Left  shifts.  At  first  we  schedule  the  operations  on  Mk,  k  >  2  in  a  single  stretch  (a 
block)  without  idle  intervals,  starting  when  the  last  job  on  Mk-\  has  been  completed.  After¬ 
wards  we  apply  a  maximum  left  shift  (without  violating  the  constraints)  to  the  whole  block. 

Let  us  define  a  blocking  job  on  A/*,  k  >  2,  in  a  no-idle  schedule  as  the  first  job  on  A/* 
that  prohibits  shifting  to  the  left. 

3.  PROPERTI1ES  OF  THE  OPTIMAL  SCHEDULE  FOR  n/VF,  NO-IDLE/XC, 

Conway,  Maxwell  and  Miller  12]  fundamental  theorem  for  flowshop  scheduling  that  states 
that  an  optimal  schedule  exists  for  a  n/m/F/b  problem  (8— any  regular  measure  of  perfor¬ 
mance)  with  the  same  processing  order  on  the  first  two  machines  [2,  p.  81],  holds  for  the  case 
under  discussion.  Thus,  the  set  of  permutation  schedules  for  n/2/F,  no-idle/XC/  is  a  dominant 
one. 
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THEOREM  1:  If  in  the  optimal  schedule  for  n/2/F,  no-idle/IC,,  the  blocking  job,  is  the 
last  one,  then  the  optimal  schedule  is  according  to  SPT  (Shortest  Processing  Time)  on  M2, 
except  for  the  blocking  job  that  is  the  one  with  the  minimum  processing  time  on  A/2. 

PROOF:  Let  us  denote,  a,  -  /(|l  6,  -  /(2;  S,k  starting  time  of  J,  on  A/*;  and  the  square 
brackets  indicating  the  place  of  a  job  in  the  sequence,  for  example,  Sin2— starting  time  of  the 
first  job  on  A/2.  Since  the  blocking  job  is  the  last  one,  we  have  (see  Figure  1). 

M  %2  "  £  *(/)  _  £  *I<1  “  K  +  *(«!’ 

<-i  i- 1 

where 


n  n 

K  -  £  a,  —  £  A,  -  Constant, 
/it  /«T 


Figure  I  Two  machines  no-idle  schedule  where  the 
blocking  job  is  the  last. 


The  sum  of  completion  times  takes  the  form 

(2)  £  Ci  -  S[|12  +  A(l)  +  S(n2  +  6|n  +  A(2 1  +  . . .  +  S[|)2  +  *[,i 

-  nS(il2  +£(»-'+  •)<>(/). 

/—I 

Substitution  of  (1)  yields 

(3)  +  nb\„\  +  V  (n  —  i  4-  1)A[/| 

i-T 

a- 1 

■*  nK  +  («  +  1 ) 6|n|  +  £  (ft  —  i  +  l)A|jj. 

To  minimize  (3)  A),)  should  be  the  smallest  and  Ajq,  /  —  1.2 . n  —  1,  a  nondecreasing 

sequence,  thus  the  optimal  schedule  is  given  by  the  sequence 

*l«l  <  Am  <  A(2]  <  ...  <  A|,_i|.  □ 

THEOREM  2:  The  jobs  that  (i)  precede  (ii)  succeed  the  blocking  job  in  the  optimal 
schedule  for  n/2/F,  no-idle/LC,  are  ordered  according  to  SPT  on  Af2,  provided  the  no-idle  con¬ 
straint  is  not  violated. 

PROOF:  Let  us  assume  that  the  blocking  job  in  the  optimal  schedule  is  the  «r-th  in  the 
sequence. 
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Figure  2.  Optimal  schedule  for  n/2/F,  no-idle/IC,  where  the  blocking  job  is  J|,|. 


W  5U12  “  X  alil  ~  £  *!*>• 

i-1  “ 

The  situation  here  is  rather  simitar  to  that  of  Theorem  1  with  it  replacing  n.  Thus,  in  the 
optimal  schedule  we  have  fyu  <  6(21  <  . . .  <  *[»-n- 

(ii)  For  /'  -  it  +  1,  it  +  2,  ,  n,  we  have 

CU]  "  Clirl  +  X  *W- 
y*ir+ 1 

Thus, 

X  Cl<j  "  -  w)Clir]  +  X  <n  ~  '  + 

/“ir+ 1  /— ir+1 

We  have  that  in  the  optimal  schedule  the  jobs  that  succeed  the  blocking  job  obey  the  SPT  rule 
on  M2 ,  provided  the  no-idle  constraint  is  not  violated,  6|,+|]  <  *[*+21  <  . . .  <  b\„\.  □ 

4.  n/mlP,  NO-IDLE/IC,,  m>  2,  WITH  A  SERIES  OF  DOMINATING  MACHINES 

Machine  Mk  dominates  M,  if 

(4)  min  tik  >  max 

/  / 

In  abbreviated  notation 

Mk  >  M,. 

A  scheduling  problem  with  a  series  of  increasing  [decreasing]  dominating  machines  is  one 
where  M\  <  M2  <  •  •  •  <  A/m[  M\  >  M2>  ...>  M„\. 

For  n/m/F,  no-idle/IC,,  m  >  2,  the  set  of  all  permutation  schedules  is  not  a  dominate 
set.  However,  for  the  sake  of  solvability  (development  of  polynomial  bounded  algorithms  for 
special  cases)  we  confine  our  search  for  optimality  to  permutation  schedules,  and  the  problem  is 
designated  n/m/P,  no-idle/IC,. 

Note  that  for  m  -  2  the  set  of  permutation  schedules  is  a  dominate  set. 

Let  y,k  be  the  processing  time  of  the  Mh  job  on  Mk  where  the  order  is  according  to  SPT 
on  the  last  machine,  Mm. 
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THEOREM  3:  The  optimal  permutation  schedule  for  n/m/P,  no-idle/IC,  with  an 
increasing  series  of  dominating  machines*  is  according  to  SPT  on  the  last  machine,  except  for 
the  first  job  that  satisfies 


Min 

i 


m- 1  J—  I 

n  Z  +  O'  -  Dyym  -  I 

*-i  /- T 


PROOF:  For  this  case  we  have 


(5)  ct  -  nS(i]m  +  Jj  (n  -  i  +  l)/|,|m, 

where  is  the  starting  time  of  the  first  job  on  the  last  machine,  and 

m—  1 

W)  S[ilm  -  f|»i. 

Thus,  for  a  given  first  job  the  optimal  permutation  schedule  is  SPT  on  Mm , 

'(2)m  <  '(31m  <  •••  <  'Ulm- 

Selecting  the  >th  job  in  the  SPT  sequence  on  Mm  to  be  the  first,  we  have 

O)  Z  C‘  ”  "  Z  yJI  +  0  ~  1  hjm  -  Z  y>«  +  5^  (w  -  '  +  1  him- 

Since  the  last  term  in  (7)  is  not  affected  by  the  choice  of  the  first  job,  the  latter  is  taken  so  as 
to  satisfy 

m-l  j-l  1 

“  «  Z  +  O'  “  1) Vjm  -  Z  T™}-  D 

*-l  i-l 


EXAMPLE  1:  A  5/4/ P,  no-idle/IC,  with  an  increasing  series  of  dominating  machines 
with  processing  times  as  per  Table  1 . 


TABLE  1  —  Processing  Times 


Jobs 

Machines'^ 

h 

h 

h 

'a 

Js 

My 

3 

P 

3 

— r 

M2 

4 

5 

6 

4 

6 

Mi 

7 

9 

8 

8 

9 

Mt, 

_ LL_ 

13 

14 

10 

12 

A f4>  Af)>  M2  >  Mi,  thus  we  have  an  increasing  series  of  dominating  machines. 
SPT  sequence  on  A/4  is  4-1-5-2-3. 


'Since  Mt+I  >  Mk,  k  -  1,2 . m  -  1,  the  no-idle  constraint  is  not  effective  and  for  the  case  under  discussion,  the 

two  problems  n/m/P,  no-idle/IC,  and  n/m/P/XC,  are  equivalent. 
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The  first  job,  say  J„  saiisfies  Vt  —  Min  Vj,  is  Ihe  t-ih  in  the  SPT  sequence  on 

j 

V,  -  5(2+4  +  8)  .  -  70 

V2  -  5(3+4  +  7)  +  11-10  -  71 

V3  -  5(2+6+9)  +  212- (11  +  12)  -88 

V4  -  5(l  +  5+9)  +  313- (10+11  +  12)  -81 

Vs  -  5(3  +  8+8)  +  414-  (10+11  +  12  +  13)  -  95 

The  job  chosen  to  be  the  first  is  the  first  in  SPT  sequence  on  Mk  (F,  -  min  Vj ),  namely  J4. 

j 

The  other  jobs  are  scheduled  according  to  SPT  on  M4  Thus,  the  optimal  permutations 
schedule  is  4-1 -5-2-3  with  C,  -  240,  and  takes  the  form  as  per  Figure  3. 

miEOMs] 


M,  I  4  I  1  I  s  |  2  |  3 


4  1 


*4 1 _ 1*1  5  2 _ I _ I _ | 

24  35  47  60  74 

Figure  3.  Optimal  schedule  for  Example  I. 

THEOREM  4:  The  optimal  permutation  schedule  for  n/m/P ,  no-idle/IC,  with  a  decreas¬ 
ing  series  of  dominating  machines  is  according  to  SPT  on  the  last  machine  except  for  the  last 
job  that  satisifies 


{m  n  ) 

Yj-  n  £yjk-  (n  -  jhjm  +  £  >,«}• 

k-2  l-j+1  I 


PROOF:  Since  M,  >  M2  >  ...  >  Mm  the  blocking  job  on  Mk,  k  —  2,3 . m,  is  the  last 

one,  we  have 

m— 1 

(8)  S[ilm  -  Jj  (5’(|](*+i)  -  Smk) 

m-1  f  »  n-1  1 


(8)  5(nm  -  Jj  (5,(ik*+i)  -  •S'nifc) 

m-l  f  a  »-l  | 

*  I  I  'l/l*  ~  X  ^K/t+l)! 

*-i  J/— T  /-i  J 

“  X  ~  ^*+1  +  '(altt+l))  **  K  +  2*  fUI(*+l)> 

*-l 

It 

where  Tk  is  the  total  processing  time  demanded  on  Mk,  Tk  -  constant  —  J?  /*  and 

K  -  r,  -  rm. 
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Substitution  of  (8)  in  (5)  yields 


X  G  “  nK  +  n  £  'uiu+u  +  X  ~  •  +  D'[,i «■ 

i-l  k- 1  /-I 


Given  the  job  in  the  last  (n-th)  place,  we  have 


X  C,  -  Af|  +  £  («  -  /  +  1)/[(| 

i-l  ” 


where  Kt  -  constant  -  nK  +  n  £  +  'l.ir 

*-i 

Minimum  of  (10)  is  obtained  by  an  SPT  sequence  on  Mm> 

'Him  <  '[21m  <  •  •  •  <  '(«-llm  ■ 

Choosing  the  >th  job  in  the  SPT  sequence  on  Mm  for  the  last  place,  we  have 

(H)  £  C, •  -  nK  +  n  £  yJ(k+ „  -  (n  -  j)y,m  +  £  yim  +£(«-/  +  l)y,m. 

/— T  /-y-f  1  /-I 

The  last  term  on  the  right  hand  side  of  (11)  is  independent  of  the  choice  for  the  last  place,  thus 
the  job  is  taken  so  as  to  minimize 


m- 1  n 

yj  -  n  X  yj(k+u  -  (n-  j)yJm  +  £  y,m  • 

k-1  l-7+l 


COROLLARY  1:  The  optimal  schedule  for  n/2/F ,  no-idle/IC,  where  M\  >  M2  is 
according  to  SPT  on  M2  except  for  the  last  job  that  is  the  one  with  minimum  processing  time 
on  M2. 

PROOF:  As  was  pointed  out  the  two  problems  n/2/P  ,  no-idle/EC,  and  n/2/F,  no- 
idle/EC,,  are  equivalent.  Substitution  of  m  »  2  in  Theorem  4  yields 

v\  ”  I  vi  "  Jy j2  +  T 

J  \  i-J+i 

Thus,  the  last  job  is  the  first  in  SPT  sequence  on  M2.  This  result  is  in  agreement  with 
Theorem  1— since  A/j  >  M2  the  blocking  job  is  the  last  one  and  should  be  the  smallest.  □ 

EXAMPLE  2:  A  5/4 /P,  no-idle/EC,  with  a  decreasing  series  of  dominating  machines 
with  processing  times  as  per  Table  2. 


TABLE  2  —  Processing  Times 


"N.  Jobs 

Machine^ 

A 

A 

A 

A 

A 

M\ 

UTT 

STTS 

"74“ 

"IF" 

"It 

m2 

7 

9 

8 

8 

9 

Ms 

4 

5 

6 

4 

6 

Mt 

3 

1 

3 

2 

2 
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M\  >  M 2  >  >  A/4,  thus  we  have  a  decreasing  series  of  dominating  machines. 

SPT  sequence  on  A/4  is  2-4-5- 1-3. 

The  last  job,  say  Jh  satisfies  Vt  =  Min  Vj,  is  the  Ath  in  the  SPT  sequence  on  A/4. 


V,  =  5(9  +  5  +  l)  -  4-1  +  2+2+3+3  =  81 

V2  =  5(8+44-2)  -  3-2  +  2  +  3  +  3  =72 

V2  =  5(9  +  6  +  2)  -  2-2  +  3  +  3  =  83 

V4  =  5(7+4  +  3)  -1-3  +  3  =70 

K5  =  5(8+6  +  3)  =  85 


The  job  chosen  to  be  the  last  is  the  fourth  in  SPT  sequence  on  M4,  ( V4  -  min  Vj),  namely  J\. 

j 

The  other  jobs  are  scheduled  according  to  SPT  on  A/4. 


s 

Thus,  the  optimal  permutation  schedule  is  2-4-5-3-1  with  ]£  Cf 


per  Figure  4. 


343  and  takes  the  form  as 


5.  it/ m/P,  NO-WAIT/ZCf,  m  >  1,  WITH  A  SERIES  OF  DOMINATING  MACHINES 

We  recall  that  "no-idle"  constraint  prescribes  for  the  machines  to  work  continuously 
without  idle  intervals,  while  the  no-wait  constraint  prescribes  for  the  jobs  to  be  processed  con¬ 
tinuously  without  waiting  times  between  consecutive  machines. 

The  set  of  all  permutation  schedules  for  an  "F,  no-wait"  problem  UtJ  >  0,  missing  operations 
are  allowed)  is  not  a  dominant  set.  However,  we  confine  our  search  for  optimality  to  permuta¬ 
tion  schedules,  namely,  the  problems  under  discussion  in  this  section,  are  particular  cases  of 
n/m/P,  no  wait/IC,.  Note  that  for  ttJ  >  0  (missing  operations  are  not  allowed)  a  feasible  solu¬ 
tion  for  an  "F,  no-wait"  problem  is  a  permutation  one,  thus  "F,  ry  >  0,  no-wait"  and  "F,  rw  >  0, 
no-wait"  are  equivalent.  As  was  previously  pointed  out,  n/2/F,  no-wait/IC,  is  NP-Complete 
while  the  compexity  of  n/m/F,  no-wait,  fy  >  0/IC;,  m  >  2,  is  an  open  problem. 
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THEOREM  5:  For  an  increasing  series  of  dominating  machines  the  two  problems  «/  m/P, 
no-wait /EC,  and  n/ m/P,  no-idle/EC,  have  the  same  optimal  sequence. 

PROOF:  For  n/m/P,  no-wait/IC,  with  an  increasing  series  of  dominating  machines,  we 

have 

Q.l  “  £  '(Hi  +  £ 

/-I  J- 1 

It  io  readily  shown  that  (12)  leads  to  (5)  and  (6).  □ 


A  direct  consequence  of  Theorems  5  and  3  is  that  the  optimal  schedule  for  n/ m / P,  no- 
wait/I C:  with  an  increasing  series  of  dominating  machines  is  SPT  on  Mm  except  for  the  first 
job  that  satisfies 


Min 

J 


Vj 


m- 1  j- 1 

«  £  vjk  +  O'  -  i)y>m  -  £  yim 


k- 1 


/-I 


EXAMPLE  3:  A  5/4/P,  no-wait/EC,  with  an  increasing  series  of  dominating  machines 
with  processing  times  as  per  Table  1  (Example  1). 


The  calculations  and  the  optimal  sequence  are  the  same  as  in  Example  1 ,  but  the  resulting 
schedule  is  as  per  Figure  5  (compare  with  Figure  3). 

THEOREM  6:  The  optimal  schedule  for  n/m/P,  no-wait/EC„  m  >  2,  with  a  decreasing 
series  of  dominating  machines*  is  according  to  SPT  on  M\. 


’Since  Mk  >  ,,  *-1,2 . m  -  1,  the  no-wait  constraint  is  not  effective  ;  for  the  esse  :r  discussion  the 

two  problems  n/m/P,  no-wait/IC,  and  n/m/P/ZC,  are  equivalent. 
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I  AIJIRI  AND  D.  POIIORVLES 


PROOF:  Since  Mk>  Mk+\,  k  -  l ,  2 . m  -  1,  we  have  for  the  optimal  schedule 

(13)  Z  C'  “  Z  'Hi;  +  'mi  +  Z  'Hi;  +  •  •  •  +  Z  Hu i  +  Z  'i»i; 

i-l  ;-l  i-l  1 

“  z  Z  'HI;  +  Z  ~  'hli ■ 

1  /-! 

The  first  term  on  the  right-hand  side  of  (13)  is  a  constant,  and  the  sequence 

'(ill  ^  'uii  <  •••  ^  'u-ui  minimizes  the  second  term.  Morever,  rInn  >  t[„-i)i,  otherwise  if 

/(„ 1 1  ^  max  Z(iH  the  value  of  the  second  term  can  be  reduced  by  interchanging  the  last  job  with 

I 

J\j\\‘  'OH  “  m?x  '(,J|-  a 

EXAMPLE  4:  A  5/4/ P,  no-wait/I C,  with  a  decreasing  series  of  dominating  machines 
with  processing  times  as  per  Table  2.  Since  Mt  >  M2>  My>  Af4  we  have  a  decreasing  series 
of  dominating  machines  and  the  optimal  sequence  is  according  to  SPT  on  Mt,  4- J -5-2-3,  with 

TC|«  247.  The  optimal  schedule  takes  the  form  as  per  Figure  6. 
i-t 


M, 


M- 


M* 


i  m 


el i  m  m  m _ de 


mm  rr 


m 


-El _ EU 

24  35 


JH _ t 

50  61 


m 


n 


Figure  6.  Optimal  schedule  for  Example  4. 
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ON  THREE  BASIC  METHODS  FOR  SOLVING 
BOTTLENECK  TRANSPORTATION  PROBLEMS 
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ABSTRACT 

For  solving  transportation  problems  essentially  three  types  of  methods  are 
known:  primal  methods,  the  Hungarian  method  and  the  shortest  augmenting 
path  method  In  this  paper  we  present  the  specialization  of  these  approaches  to 
the  bottleneck  transportation  problem  and  report  some  computational  experi¬ 
ence. 


\ 

► 

t 


1.  INTRODUCTION 

The  classical  transportation  problem  (TP)  can  be  formulated  in  the  following  way.  There 
are  m  supply  points  and  n  demand  points  with  supply  point  i  capable  of  supplying  amount 

a,(/  -  1 . m)  and  demand  point  j  having  demand  bj(J  =  1 . n)  with  £  a,  -  £  bj. 

<  j 

Find  the  least  cost  transportation  pattern  from  the  supply  points  to  the  demand  points  when  the 
cost  of  transporting  a  unit  amount  from  /'to  j  is  c,j ,  i.e., 


m  n 


min  Hc(/'  xu 

/- 1  >-i 

subject  to 

(1.1) 

£  xu  - a' 

(/  -  I . m) 

7-1 

(1.2) 

I**-*, 

i-i 

0-1 . n) 

(1.3) 

W 

o 

(/'  -  1 . m\ 

A  related  problem  is  the  bottleneck  transportation  problem  (BTP).  Here  a  transportation 
time  is  specified  between  each  supply  point  i  and  each  demand  point  j.  Now  it  is  required  to 
find  a  transportation  pattern  which  minimizes  the  total  time  necessary  for  transporting  the 
goods  from  the  supply  to  the  demand  points.  Thus, 

(1.4)  min  max  f/yUy  >  0|  subject  to  (1.1),  (1.2)  and  (1.3) 

Problem  (1.4)  is  often  called  "time-transportation  problem."  It  occurs  in  connection  with  tran¬ 
sportation  of  perishable  goods,  with  the  delivery  of  emergency  supplies  or  when  military  units 
are  to  be  sent  from  their  bases  to  the  front. 


«■ 
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For  solving  TP  we  know  essentially  three  types  of  methods 

—  primal  methods 

—  the  "Hungarian”  method 

—  the  shortest  augmenting  path  method. 

Nearly  every  approach  for  solving  TP  can  be  modified  to  treat  BTP. 

So  for  instance  the  primal  method  of  Hammer  (10],  [10al,  for  solving  BTP  is  strongly 
related  to  the  method  of  Klein  [11]  for  solving  TP.  Starting  from  a  feasible  solution  both 
methods  are  looking  for  "negative  cycles"  to  improve  the  actual  solution. 

In  this  paper  we  present  these  three  basic  approaches  for  solving  BTP  and  we  report  some 
computational  experience  with  these  methods. 

2.  THE  HUNGARIAN  METHOD 

This  method  for  solving  BTP  was  first  proposed  by  Garfinkel  and  Rao  [8].  Inspired  by  a 
work  of  Edmonds  and  Fulkerson  (51  on  general  bottleneck  problems,  they  called  the  procedure 
the  Threshold  Method.  The  algorithm  can  be  described  in  the  following  way:  First  determine  a 
"good"  lower  bound  z  for  the  optimal  objective  value  z.  Then  define  a  network  rfiz)  in  the  fol¬ 
lowing  way:  Let  V  —  (s,/}0  M  0  N  with  Af  =*{1,2 . m }  and  N  —  [m  +  1 . m  +  n}. 

be  the  nodeset  of  X(z).  Now  every  node  /'  €  M  is  connected  with  s  by  an  arc,  respectively, 
each  m  +  j  €  N  with  t.  A  pair  (i,m  +  j)  is  connected  by  an  arc  only  if  <  z  holds. 

The  arc-capacities  dk  i  are  defined  by 

dsi  :-  a(  (i  —  1 . m) 

d,.m+J  °°  (/-l . m\j  =  1 . n ) 

dm+j.t  *>j  0-1 . n). 

Now  a  maximum  flow  /  -  (ftJ)  from  source  s  to  sink  t  is  determined  using  the  labeling  method 
of  Ford  and  Fulkerson  (6],  If  for  the  maximal  flow  value  v  —  £  a,  holds,  the  optimal  solution 

/ 

for  the  BTP  is  obtained  defining 

*u  ft.m+j  0-1 . m\j  -  1 . n) 

If  v  <  I  a,  the  labeling  procedure  yields  simultaneously  a  minima!  cut  (X,X)  in  S(z) 
with  s  €  X  and  t  €  X.  This  cut  has  the  property 

0;ro  +  j)  ?  iX.X) 

To  obtain  a  solution  to  (l.D-0.3)  it  is  therefore  necessary  to  use  an  arc  (i,m  +  j)  with  ttj  >  z. 
Hence,  we  determine 

z*  :—  min  { I /  €  A',  m  +  j  €X]  >  z- 

Define  z  :-  z*  and  repeat  the  process. 

After  im  •  n)  iterations,  at  most,  an  optimal  solution  is  obtained.  The  algorithm  is  sum¬ 
marized  in  the  following  flow-chart: 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  29,  NO.  3.  SEPTEMBER  1982 


SOLVING  BOTTLENECK  TRANSPORTATION  PROBLEMS 


507 


Figure  1.  Flow-chart  of  ihe  Hungarian  method 


During  the  algorithm  the  network  X(z)  need  not  be  constructed  explicitly.  All  operations  of 
the  labeling  procedure  can  be  performed  directly  on  the  "time"  —  matrix  T  -  (t0). 

3.  A  PRIMAL  METHOD 

The  first  method  for  solving  BTP  proposed  by  Barsov  [l]  is  a  primal  algorithm  which 
proceeds  "dual"  to  the  Hungarian  method  in  some  sense.  In  the  course  of  the  Hungarian 
method  the  "threshold"  z  is  successively  increased  until  a  feasible  solution  can  be  found.  In  this 
method  a  feasible  solution  is  always  at  hand  and  the  threshold-value  is  successively  decreased 
until  no  further  solution  can  be  found.  Starting  from  a  feasible  solution  x  -  (xiy)  with  objec¬ 
tive  value  z  a  cost-matrix  D  *•  ( dtj )  is  defined  by 

j  0  if  t,j  <  z 
d'J  "  j  1  else 

Now  a  TP  with  cost-matrix  D  is  solved.  If  the  optimal  solution  y  —  (y,j)  has  an  optimal 
value  z(y)  >  0  the  solution  x  —  (xy)  is  optimal  for  BTP,  Otherwise 

z*  max  { t,j  ly^  >  0}  <  z  holds. 


Define  1 z*  and  x  y  and  repeat  the  process. 

Computational  experience  shows  18]  that  this  method  is  inferior  to  the  Hungarian 
method.  Recently,  Finke  and  Smith  {7]  developed  an  improved  primal  method. 
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Starting  from  a  lower  bound  z  and  a  "good"  start  solution  x  -  (xu),  a  cost-matrix 
D  -  ( d,j )  is  defined  by 


z  +  1 


with  lx]  greatest  integer  <  x. 


Now  the  associated  TP  is  solved  using  the  well  known  MODI-method  and  x  —  (x</)  as 
start  solution.  If  the  optimal  solution  y  —  0^)  has  objective  value  z(y)  —  0,  then  y  is  optimal 

for  BTP.  Otherwise,  we  consider  the  dual  variables  «,•,/—  1,  ....  m  and  vy,  j  —  1 . n 

associated  with  the  optimal  solution  y.  Then 

u,  +  v  j  ^  dtJ  i  =  1,  ....  nr,  j  -  1 . n 

y.j  >  0  -►  U, ■  +  Vj  - 
and 

z' min  |/y|«,  +  vy  >  0}  >  z  holds. 


It  can  be  shown  that  z'  is  a  lower  bound  for  the  optimal  value  for  the  BTP.  Define  z  :«  z' 
and  repeat  the  process.  In  an  implementation  the  actual  basic  solution  of  the  TP  is  stored  as  a 
tree  using  the  list  structures  proposed  by  Glover  and  Klingman  [9],  Srinivasan  and  Thompson 
[12].  Due  to  this  technique  primal  methods  are  superior  to  other  methods  in  the  case  of  TP. 

The  following  flow-chart  summarizes  the  algorithm  of  Finke  and  Smith. 


Figure  2.  Flow-chart  or  the  primal  method 
of  Finke  and  Smith 
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4.  THE  SHORTEST  AUGMENTING  PATH  METHOD 

This  method  was  proposed  by  Tomizawa  114}  for  solving  TP  and  applied  to  BTP  by  Derigs 
and  Zimmermann  [4],  A  theoretical  foundation  of  this  method  can  be  found  in  [2J.  During 
the  procedure  so-called  /c-subproblems  (Pk)  have  to  be  solved  for  1  <  k  <  m. 

(Pk)  min  max  >  0} 

n  0/  1-1.2 . k 

£  X,J  “  0  else 
y-i 

ftt 

£  x,j  <  bj  j  -  1 . n 

i-i 

Xjj  >  0  i  —  1 . m ;  j  —  1 . n. 

The  solution  for  (P{)  is  obvious.  Starting  from  an  optimal  solution  x  -  (x,j)  for  (Pk) 
with  k  —  1,  ,  m  -  1  an  optimal  solution  for  (P*+1)  is  obtained  by  means  of  augmenting 

paths.  For  this  purpose  we  partition  (1,2 . n]  —  /V,  0  N2  with 

A,  =  |/ 1  x0  -  byj  "saturated  columns.” 

Now  a  sequence  Ax  of  mutually  distinct  matrix-entries 

u  +  ij,).  Uij\),  ( i\,h ).  ow2).  ...  (w,),  (wf+i) 
is  called  an  augmenting  path  with  respect  to  x  -  (xy)  if 
x.  ,  >  0  for  q  -  1.2 . r  and 

q'Jq 

jq  €  N\  for  q  —  1,2,  . . .  ,r  and  jr+\  €  N2. 

The  capacity  of  Ax  is  defined  by 

k 

cap  (Ax)  min  {a*+,,  bJr+[  -  ]£  x,Vf+|,  «}  with 

/• T 

«  min  lx,  ,  I9  —  1,2 . r}. 

Let  us  define  x  x  ©  Ax  by 

Xjj  -  cap  (Ax)  for  (i.j)  -  (i\.j\) . U„jr) 

^  Xjj  +  cap  (Ax)  for  Uj)  €  Ax\l(i,,y,)|<j  —  1,  ....  r} 

Xjj  else 

and 

w  (Ax)  max  [t,j\xtJ  >  0}. 

Let  Dk  be  the  set  of  all  augmenting  paths  with  respect  to  x  -  (x^)  and  Ax  €  Dk  with 

(4.1)  w(Ax)  <  w(Ax)  for  Ax  €  Dk 

(4.2)  cap  (Ax)  -  a*+j 

then  x  x  $  Ax  is  an  optimal  solution  for  (P*+t). 
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An  augmenting  path  Ax  €  Dk  with  property  (4.1)  can  be  determined  using  a  modified 
Dijkstra  method.  If  Ax  does  not  fulfill  (4.2)  we  split  (Pk+\)  into  two  subproblems  (Pk+\ )  and 
(Pk+ 1 )  with  <j*+i  cap  (Ax)  and  ak+]  :-  ak+i  -  ak+l.  Thus,  (P*+I)  can  be  solved  in  a  finite 
number  of  steps. 

Computational  experience  shows  that  the  method  can  be  improved  with  the  aid  of  the  fol¬ 
lowing  starting  procedure.  First  we  determine  a  "good"  lower  bound  z  for  the  optimal  value  z 
and  a  "good"  partial  solution  x  -  (x0),  i.e. 


£  *,J  <  a, 

I 


I 

/-T 


(4.5) 

with  the  property 

(4.6) 
such  that 


Xij>  0 


,7-1. 

,i-l. 


m,  j  —  1 . n 


*ij>  0-*  tij  <  * 


£  £  xy  is  large. 

<-■  ;-i 

A  partial  solution  x  with  (4.3)  •  (4.6)  is  called  ^-feasible.  Computational  tests  have  shown 
that  it  is  not  useful  here  to  determine  a  z-feasible  solution  with  maximal  possible  sum.  Some 
simple  heuristics  will  give  solutions  which  are  within  5%  from  optimality  much  faster  and,  with 
respect  to  the  running  time  of  the  complete  procedure,  this  is  of  advantage.  The  complete  pro¬ 
cedure  is  summarized  in  the  following  flow  chart. 


Figure  3.  Flow-chart  of  the  shortest 
augmenting  path  method. 
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5.  LOWER  BOUNDS  AND  STARTING  PROCEDURES 

All  three  methods  presented  above  start  with  a  lower  bound  z  for  the  optimal  value  z. 
Computational  experience  shows  that  the  bound  proposed  by  Garfinke)  and  Rao  is  favorable. 
Row-thesholds  z,  and  column-thresholds  zJ  are  computed  with 

z,  min  max  {fy|xiy  >  0}  subject  to 

i  xu  >  a< 

j-  i 

0  ^  X/j  <  bj  for  /'  =  1 ,  . . .  ,  m 
and  zJ  defined  analogously. 

Then  z  max  {zi ,z2.  •••  .  zm,  z',z2,  ,  z")  is  a  lower  bound.  This  bound  is  easy  to 

calculate  and  yields  a  good  estimation  for  the  optimal  value  in  most  cases. 


The  primal  method  of  Finke  and  Smith  heeds  a  good  starting  solution  x  =  (x0).  They 
propose  the  following  method.  First  a  z-feasible  partial  solution  x  —  (ity)  is  constructed.  Such 
a  partial  solution  is  used  for  the  shortest  augmenting  path  method,  too.  A  z-feasible  solution 
can  be  obtained  in  the  following  way.  For  every  row  i  (column  j)  the  number  7]  ( P)  of  feasi¬ 
ble  matrix  entries  (i,j)  subject  to  ttJ  <  z  is  computed.  Now  the  row  i0  (column  j0)  with 
minimal  7)#  >  0  (7J°  >  0)  and  positive  a/g  ( bJg )  is  determined.  If  no  such  row  (column)  exists 
the  partial  solution  is  completed.  Otherwise,  determine  in  row  /0  (column  j0 )  the  feasible  entry 
('Wo)  w>th  minimal  number  7J°  (7jo)  and  positive  bJg  Then  define 


\jo  min  K  bjJ 


%  ~  %  Vo 

bjQ  •"  K  ~  \-Jo‘ 


Update  Tj  and  V  and  repeat  the  process. 


This  way  a  good  z-feasible  partial  solution  x  -  (x0)  is  determined.  If  x  —  (x,j)  fulfills 

0.1)  and  (1.2)  then  it  is  an  optimal  solution  for  BTP.  If  the  partial  solution  is  not  optimal  in 
the  second  step  of  the  starting  procedure  proposed  by  Finke  and  Smith,  the  remaining  supply  is 
distributed  using  a  north-west-corner  rule.  Finke  and  Smith  call  their  procedure  threshold- 
totals-method. 


The  following  flow-chart  summarizes  the  starting  procedures  for  the  three  methods. 
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Figure‘4.  Flow-chart  of  the  starting  procedures. 


6.  COMPUTATIONAL  RESULTS 

FORTRAN  IV  implementations  of  all  three  methods  were  tested  on  a  CDC  CYBER  76  of 
the  Computer  Center  of  the  University  of  Cologne.  We  used  the  following  codes: 

HUNGAR  -  a  version  of  the  Hungarian  method  which  Professor  Garfinkel  kindly  made 
available  to  us 

PRIMAL  -  a  version  of  the  primal  method  which  Professor  Finke  kindly  made  available 
to  us 

SAP  -  an  improved  version  of  the  BTP-code  which  is  listed  in  [41.  The  improve¬ 
ment  concerns  mainly  the  starting  procedure. 

The  HUNGAR-code  uses  the  maximal  cardinality  NL  of  expected  postive  xu's  in  any 
column  as  input  data.  The  running  time  of  the  program  is  highly  dependent  on  the  choice  of 
NL  since  the  value  and  position  of  the  x,/s  are  stored  in  NL  x  N  matrices.  Thus,  the  choice  of 
smaller  "NL"  decreases  the  running  time.  But  if  the  actual  number  exceeds  NL,  termination 
occurs.  Choosing  NL  "  M/2  all  problems  were  solved. 

We  considered  rectangular  as  well  as  quadratic  examples.  The  integer  coefficients  of  the 
time  matrix  T,  the  supply  vector  a  and  the  demand  vector  b  were  generated  by  a  machine 
independent  uniformly  distributed  pseudo  random  number  generator  in  the  interval  [1,2}I  -11. 
Afterwards  these  numbers  were  transformed  into  the  intervals  [1,6]  with 
b  -  10,  100,  1000,  10000,  231  -1.  Twenty-five  examples  were  generated  for  each  combina¬ 
tion  to  calculate  the  mean  running  time.  Numerical  experience  showed  that  the  running  time 
of  HUNGAR  and  SAP  for  solving  rectangular  problems  with  m  >  n  is  less  than  for  the 
equivalent  transposed  problem  with  m  <  n.  Therefore,  we  recommend  that  rectangular  prob¬ 
lems  always  be  solved  in  the  form  with  m  >  n. 
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The  following  table  shows  the  mean  running  time  for  quadratic  problems  with 
m  —  n  —  100. 


TABLE  l  —  Mean  Running  Time  in  CPU -Seconds  for  (100  x  100)  BTP 


a i.bj 

1-10 

1-100 

1-1000 

1-10000 

l-<231  -1) 

SAP 

.112 

.177 

.177 

.192 

.181 

t 

PRIMAL 

.106 

.124 

.124 

.133 

.126 

HUNGAR 

.226 

.400 

.383 

.478 

.504 

g 

SAP 

.107 

.120 

.144 

.171 

.150 

i 

PRIMAL 

.094 

.106 

.105 

.106 

111 

HUNGAR 

.332 

.582 

.611 

.588 

.672 

© 

SAP 

.089 

.140 

.188 

.121 

.223 

2 

PRIMAL 

.092 

.098 

.119 

.106 

.114 

- 

HUNGAR 

.333 

.439 

.548 

.603 

.713 

The  mean  running  time  of  PRIMAL  is  significantly  better  than  the  other's.  For  SAP  this 
is  simply  caused  by  a  single  "ill  conditioned"  problem  for  which  the  running  time  is  more  than 
tenfold  the  running  time  of  the  other  24  examples  of  the  group  whose  running  time  is  compar¬ 
able  to  those  of  PRIMAL. 

SAP  and  PRIMAL  are  shown  to  be  relatively  insensitive  to  the  range  of  the  parameters 
a i.bj  and  ti}  while  the  running  time  of  HUNGAR  is  more  than  doubled  when  the  range  for  tu  is 
increased  from  b  —  10  to  b  -  231  -1. 

Then  we  modified  the  problems  subject  to 

t\j  —  r,|  —  0  for  /  —  1 . m  and  j  —  1 . n  and 

a\  -  />i  -  max  {max {a,},  max  (ft,}}. 

For  these  perturbated  problems  the  starting  procedure  has  no  effect  and  thus  the  computational 
behavior  of  the  "pure"  algorithms  is  shown. 

Table  2  shows  that  PRIMAL  and  HUNGAR  are  more  dependent  on  the  quality  of  the 
starting  procedure  and  the  range  of  the  time  values  ttJ  than  SAP. 


TABLE  2  —  Mean  Running  Time  in  CPU -Seconds  for  the  Perturbated 
(40  x  40)  Problems 


a  i.bj 

1-10 

1-100 

1-1000 

1-10000 

l-(231  -1) 

© 

SAP 

.014 

.105 

.122 

.117 

.121 

1 

PRIMAL 

.015 

.118 

.369 

.446 

.448 

HUNGAR 

.016 

.269 

.608 

.720 

.686 

© 

SAP 

.014 

.153 

.196 

.202 

.169 

$ 

PRIMAL 

.014 

.133 

.389 

.483 

.504 

HUNGAR 

.026 

.436 

.790 

.922 

.988 

© 

SAP 

.014 

.161 

.179 

.217 

.164 

© 

PRIMAL 

.015 

.132 

.398 

.483 

.506 

i 

HUNGAR 

.025 

.422 

.834 

.937 

.983 
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Then  we  applied  the  algorithms  to  bottleneck  assignment  problems  i.e.,  problems  with 

a,  -  bj=  1 . 


TABLE  3  —  Mean  Running  Time  in  CPU-Seconds  for  (100  x  100) 
Bottleneck  Assignment  Problems 


'■j 

1-10 

1-100 

1-1000 

1-10000 

1  -  (231  -1) 

|  E 

SAP 

.074 

.074 

.074 

.073 

.071 

E  o 

C  £> 
oo  a 

PRIMAL 

.083 

.092 

.100 

.096 

.105 

m  >* 
c n  CL 

< 

HUNGAR 

.055 

.103 

.143 

.140 

.155 

For  assignment  problems  SAP  needs  at  most  n/2  iterations  while  PRIMAL  will  perform 
a  number  of  degenerate  pivot  operations.  Again  PRIMAL  and  SAP  show  a  more  robust  nature 
with  respect  to  the  range  of  the  time  parameters  t,j  than  HUNGAR. 

More  computational  results  can  be  found  in  [3]  and  [4]. 

7.  CONCLUSIVE  REMARKS 

In  this  paper  we  presented  three  basic  methods  for  solving  the  bottleneck  transportation 
problem.  It  is  beyond  the  scope  of  this  study  to  give  an  entire  review  of  all  the  different 
methods  which  are  available  for  solving  BTP.  Nevertheless,  we  think  that  the  presented  pro¬ 
cedures  are  representative  in  the  sense  that  the  main  approaches  for  tackling  combinatorial 
optimization  problems  are  specialized  to  the  bottleneck  transportation  problem. 

The  computational  tests  indicate  that  PRIMAL  and  SAP  outperform  HUNGAR  in  gen¬ 
eral.  The  optimal  choice  between  PRIMAL  and  SAP  is  data  dependent  and  should  be  specially 
made  for  every  problem  to  be  solved.  If  it  can  be  expected  that  the  start-heuristic  is  perform¬ 
ing  well,  PRIMAL  is  recommended.  Otherwise,  SAP  is  of  advantage. 

We  want  to  close  our  discussion  by  referring  to  another  computational  study.  Werner 
(15]  compared  versions  of  SAP  and  PRIMAL  on  special  problems  the  data  of  which  was 
defined  by  some  structured  transportation  networks  incorporating  time  tables  for  transportation 
vehicles,  waiting-times  and  different  regions  with  different  magnitudes  of  demand  and  supply. 
Werner  reports  that  the  running  time  for  PRIMAL  was  highly  dependent  on  the  distribution  of 
data  and  the  more  the  data  was  not  uniform  the  more  SAP  became  superior. 
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ABSTRACT 

The  paper  extends  the  machine  flow-shop  scheduling  problem  by  separating 
processing  time  into  setup,  processing  and  removal  times. 


In  a  recent  paper  Yoshida  and  Hitomi  [2]  extended  the  classic  two  machine  scheduling 
problem  first  introduced  by  Johnson  ll].  They  allowed  setup  times  to  be  independent  of  the 
processing  times.  This  paper  is  the  further  extension  of  the  Yoshida  and  Hitomi  model;  it 
allows  for  separation  of  processing  time  into  setup  time,  processing  time  and  removal  time  for 
each  job  on  each  machine. 

For  example,  consider  a  machine  shop  environment.  Operations  associated  with  each  job 
on  each  machine  when  the  machine/operator  is  available  could  be  summarized  as  follows: 

1 .  Setup  time  independent  of  the  unit  to  be  processed.  This  operation  consists  of  activi¬ 
ties  such  as  obtaining  the  blueprints,  procuring  the  necessary  tools,  fetching  the 
required  jigs  and  fixtures  and  setting  them  on  the  machine. 

2.  Setup  time  that  is  a  unit  dependent.  This  operation  includes  the  time  required  to  set 
the  unit  in  the  jigs  and  fixtures  and  to  adjust  the  tools  as  required. 

3.  Processing  time. 

4.  Removal  time  dependent  on  the  unit.  This  operation  includes  the  times  for  activities 
such  as  disengaging  the  tools  from  the  unit,  and  releasing  the  unit  from  the  jigs  and 
fixtures. 

5.  Removal  time  independent  of  the  unit.  This  operation  includes  activities  such  as  dis¬ 
mantling  the  jigs,  the  fixtures  and/or  tools,  inspecting/sharpening  of  the  tools, 
returning  them  to  the  central  depository,  cleaning  the  machine  and  the  adjacent  area. 

Since  activities  2,  3  and  4  are  unit  dependent,  their  times  could  be  combined  and  desig¬ 
nated  as  the  processing  time.  However,  times  for  the  activities  1  and  5  are  independent  of  the 
unit  to  be  processed. 


VOL.  29,  NO.  3,  SEPTEMBER  1982 


S17 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


$18 


»  R  SULK 


Formulation  of  the  Problem 


Let, 


P"  -  processing  time  of  job  /  on  machine  m.  This  includes  times  for  the  activities  2,  3  and 
4  described  before,  i  -  1,2,  ...  »,  m  -  1,2. 

S”  -  setup  time  independent  of  the  unit,  i.e.,  activity  1,  for  job  /on  machine  m. 

R”  —removal  time,  i.e.,  activity  5,  for  job  /  on  machine  m. 

C”  —completion  time  of  job  /  on  machine  m. 

In  addition,  in  the  mathematical  development,  it  is  assumed  that  the  jobs  are  designated 
so  that  the  job  in  the  Ah  position  of  the  processing  sequence  is  job  /.  Since  the  setup  is 
independent  of  the  unit,  if  there  exists  an  idle  time  on  machine  II,  setup  on  machine  II  can  be 
done  before  the  unit  is  available  from  machine  I.  The  unit  is  available  from  machine  I  when 
processing  on  the  unit  is  completed;  however,  the  machine  is  not  available  for  the  next  job 
until  the  removal  operation  on  the  machine  is  finished.  Figure  1  shows  these  activities  graphi¬ 
cally. 


Machine  I 


i 

i 


1 


Figure  l.  Graphical  illustration  of  two  machine  problem- 


Completion  time  for  job  i  on  machine  I  is  given  by, 

(1)  c,1  -  £  s'  +  pj  +  *1 

Completion  time  for  job  /  on  machine  II  is 

(2)  C,2  -  (C,L,  +  S/  +  P,')  +  P?  +  R ? 

if  C/1 1  +  5, 2  <  C/1 1  +  S'  +  P'  (see  Figure  1) 

or 

C,2  -  C/1 1  +  S/2  +  P}  +  R,2 
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+  S,2  >  C,L,  +  5,'  +  P,'. 


The  combination  of  (2)  and  (3)  gives 
(4)  C2  =  Max  1C,1  -  R,'  -  S,2.  C,2_,  ]  +  S2  +  P2  +  R,2. 

By  successive  application  of  (4)  using  (1),  the  total  elapsed  time  is  given  by 


(5) 


T  =  C2  =  Max 


Max 

0<  u  ^  n 


I  (J,1  -  S2  +  P,')  -  £  +  *,2  -  *,'> 

I- 1  1-1 


,  o 


+  T  ( S 2  +  P2  +  R,2). 
<-T 


The  optimal  schedule  can  be  found  by  using  Johnson’s  method  to  solve  a  two  machine  flow- 
shop  problem  where  the  processing  times  of  job  ion  machines  I  and  II  are  S,1  —  5, 2  +  P,]  and 
P2  +  R2  -  R,\  respectively. 
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1.  INTRODUCTION 

The  theory  of  two-person,  zero-sum  games  has  occupied  a  special  place  in  the  general 
theory  of  games  ever  since  its  introduction  by  von  Neumann  and  Morgenstern  [17].  They 
presented  the  theory  of  two-person,  zero-sum  games  as  a  natural  extension  of  the  theory  of 
rational  choice  under  uncertainty,  as  modeled  by  the  maximization  of  an  expected  utility  func¬ 
tion.  Their  conclusion  was  that  rational  players  should  always  play  their  maximin  strategies  in 
such  games,  and  should  regard  the  maximin  payoff  as  the  "value"  ol  the  garne  t 

Subsequent  authors  have  questioned  both  the  validity  and  generality  of  these  conclusions. 
Ellsberg  [4],  for  instance,  points  to  gaps  in  the  arguments  which  von  Neumann  and  Morgen¬ 
stern  present  in  support  of  their  conclusions,  and  raises  questions  related  to  the  interpretation 
of  the  naure  of  the  outcomes  of  the  game,  while  Aumann  and  Maschler  [1]  discuss  issues 
which  arise  in  passing  from  the  extensive  to  the  normal  form  of  a  game.  This  latter  discussion 
has  generated  a  lively  controversy  (cf.  Aumann  and  Maschler  [2),  Davis  [31,  Owen  [71,  Taylor 
[13]).  McClennen  [6]  also  examines  gaps  in  the  arguments  presented  by  von  Neumann  and 
Morgenstern,  and,  like  Aumann  and  Maschler,  concludes  that  the  prescription  that  rational 
players  should  choose  maximin  strategies  cannot  be  derived  directly  from  the  principle  that 
rational  individuals  are  utility  maximizers. 

There  is  thus  a  gap  between  the  rationality  assumptions  which  insure  that  an  individual 
evaluates  single  person  decision  problems  as  a  utility-maximizer,  and  the  assumptions  which 
insure  that  he  evaluates  two- person,  zero-sum  games  in  terms  of  the  maximin  value.  Two  dis¬ 
tinct  issues  are  involved  here,  since  the  maximin  value  is  intended  to  be  used  both  to  evaluate 
alternative  games  and  to  identify  maximin  strategies  as  "rational"  choices.  This  paper  addresses 
the  first  of  these  issues. 

One  approach  to  studying  the  maximin  value  is  the  axiomatic  approach  of  Vilkas  [15]  and 
Tijs  (141,  who  consider  arbitrary  functions  defined  on  games  and  present  axioms  which  are 


•Thij  work  was  supported  by  National  Science  Foundation  Grant  SOC75-21820  to  the  Institute  for  Mathematical  Stu¬ 
dies  in  the  Social  Sciences,  Stanford  University,  and  by  National  Science  Foundation  Grant  SOC78-09928  to  the 
University  of  Illinois. 

tStrictly  speaking,  this  conclusion  applies  only  to  games  having  saddlepoints  in  the  set  of  admissable  strategies.  Such 
games  are  sometimes  referred  to  as  determined  games. 
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uniquely  satisfied  by  the  maximin  value.  This  paper  takes  a  different,  but  related,  approach, 
and  shows  that  the  maximin  value  of  a  two-person,  zero-sum  game  can  be  interpreted  as  a 
rational  individual’s  von  Neumann-Morgenstern  utility  for  playing  the  game,  and  demonstrates 
necessary  and  sufficient  conditions  on  the  individual’s  preferences  over  games  for  this  to  be  the 
case.  That  is,  we  will  consider  conditions  which  an  individuals's  preferences  must  obey  in  addi¬ 
tion  to  those  which  insure  that  he  is  a  utility  maximizer,  in  order  for  his  utility  function  to  coin¬ 
cide  with  the  maximin  value.  This  approach  has  been  used  to  study  other  classes  of  games: 
e  g.,  cooperative  games  with  side  payments  (Roth  (81,  (91),  simple  games  (Roth  [10]),  and  bar¬ 
gaining  games  without  side  payments  (Roth  (11],  (121).  The  concluding  section  of  this  paper 
will  discuss  briefly  the  relationship  between  the  results  obtained  here  and  these  other  results. 

It  should  perhaps  be  emphasized  at  this  point  that  it  is  not  the  purpose  of  this  paper  to 
further  explore  the  mathematical  properties  of  the  maximin  value,  which  are  well  understood. 
The  questions  which  have  been  raised  in  some  of  the  papers  cited  above  concern,  rather,  the 
interpretation  which  can  be  placed  on  the  maximin  value  as  part  of  a  theory  of  rational  (i.e., 
utility-maximizing)  behavior.  The  purpose  of  this  paper  is  to  further  explore  this  latter  issue. 

2.  THE  MAXIMIN  VALUE  AS  A  VON  NEUMANN-MORGENSTERN  UTILITY 

A  two-person,  zero-sum  game  g  will  be  denoted  by  a  triple  g  «  (SuS2,u),  where  S|  and 
S2  are  arbitrary  sets,  and  u  is  a  function  such  that  u:S |  x  —  /?2,  and  such  that,  for  any 
(s,f)  €  S,  x  S2,  ut(s,t)  =  -u2(s,t).  The  interpretation  is  that  in  the  game  g,  the  player  in 
position  /  (/  =1,2)  will  have  available  a  set  of  strategies*  S„  and  will  compete  for  prizes  over 
which  his  preferences  are  represented  by  the  utility  function  u,.  His  opponent’s  preferences 
will  be  precisely  opposite  his  own.  (Note  that  in  general  it  is  not  necessary  that  the  players 
compete  for  a  single  set  of  physical  prizes.t  It  is  sufficient  that  the  results  of  the  game  be  such 
that  the  two  players  are  never  in  agreement  over  which  of  two  outcomes  is  preferable.) 

This  formulation  defines  the  game  independently  of  the  individuals  who  will  play  it.  Con¬ 
sequently,  it  is  meaningful  to  consider  the  preferences  of  some  individual  over  the  positions  in 
a  game,  or  in  different  games.  Letting  G  be  some  class  of  two-person,  zero-sum  games,  and 
letting  /V  =  (1,2)  be  the  set  of  positions,  we  will  be  considering  an  individual’s  binary  prefer¬ 
ence  relation  P  defined  on  N  x  (J,  the  set  of  positions  in  a  game.  For  i,  j  €  N  and  g ,  g'  €  (7, 
the  statement  ( i.g)P(j,g ')  should  be  read  "it  is  preferable  to  play  position  /  in  game  g  than  to 
play  position  j  in  game  g'."  We  will  interpret  P  as  a  strict  preference  relation,  and  define  an 
indifference  relation  I  and  a  weak  preference  relation  W  in  the  usual  way.*  We  will  assume  that 
the  preference  relation  is  defined  over  the  mixture  set  of  lotteries  generated  by  N  x  G,  and 
that  it  satisfies  the  rationality  conditions  necessary  for  the  existence  of  an  expected  utility  func¬ 
tion  v. 


’Typically  the  strategy  set  5,  could  be  the  set  of  all  probability  mixtures  of  some  finite  set  of  pure  strategies 
+Two  person,  zero-sum  games  are  sometimes  interpreted  as  games  in  which,  for  every  pair  of  strategy  choices,  one 
player’s  gains  are  precisely  equal  to  the  other's  losses  in  terms  of  some  physical  commodity,  such  as  money  Under 
this  interpretation,  the  game  would  only  be  zero-sum  in  the  rare  case  that  the  two  players  have  precisely  opposed 
preferences  for  lotteries  involving  money  In  our  formulation,  the  game  specifies  the  utility  payoffs  to  be  faced  by  the 
players  For  a  given  pair  of  individuals  to  play  a  specified  game,  the  monetary  rewards  to  each  would  have  to  be  ad¬ 
justed  in  terms  of  each  individual's  utility  function  for  money 

*So,  for  a.b  €  N  x  G.  alb  if  and  only  if  neither  aPb  nor  bPa.  and  aUb  if  and  only  if  either  aPb  or  alB. 
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Since  we  are  interested  in  preferences  over  a  possibly  infinite  set  of  alternatives  (e.g.,  let 
G  be  the  set  of  all  matrix  games)  it  will  be  simplest  to  refer  to  the  axiomatization  of  utility  due 
to  Herstein  and  Milnor  |5l,  rather  than  to  von  Neumann  and  Morgenstern’s  original  axiomati¬ 
zation.  Denote  by  [p(i,g)\  (1  -  p)(J,h)\  the  lottery  which  with  probability  p  has  an  individual 
play  position  i  in  game  g,  and  with  probability  1  -  p  play  position  j  in  game  h.  Then  we  assume 
that  the  preference  relation  is  defined  on  a  mixture  set  containing  all  such  lotteries,  and  that 
this  preference  relation  has  the  properties  of  transitivity,  continuity,  and  substitutability 
sufficient  for  the  existence  of  an  expected  utility  function  (Herstein  and  Milnor  [51).  That  is, 
there  exists  a  function  v  (unique  up  to  origin  and  scale)  which  preserves  preference  (v(/,g)  > 
vO',/i)  iff  (i.g)P(j-h))  and  evaluates  the  utility  of  a  lottery  by  its  expected  utility  (v[p(/,g); 
(1  -  p)(J,h)]  =  pv((/,g))  +  (1  -  p)\((j,h)). 


The  fact  that  we  are  considering  preferences  defined  over  games,  which  are  themselves 
defined  in  terms  of  an  expected  utility  function,  puts  some  additional  restrictions  on  the  allow¬ 
able  preferences.  In  particular,  for  any  real  number  k ,  let  gk  be  the  (degenerate)  game  defined 
by  gk  «  (Si,S2,«)  such  that  |S)|  -  |S2I  "  1.  and  «)(Si,S2)  -  k.  Thus  g*  is  the  game*  in  which 
each  player  has  only  one  strategy  to  choose,  which  results  in  the  player  in  position  1  receiving  a 
utility  of  k ,  and  the  player  in  position  2  receiving  a  utility  of  —  k.  Since  k  is  an  expected  utility, 
we  know  precisely  how  an  individual’s  preferences  should  behave  over  alternative  games  gk. 
r  ^mally,  we  assume  that  the  set  G  contains  the  set  of  degenerate  games,  and  impose  the  fol- 
.  /ing  conditions  on  preferences  over  degenerate  games,  which  have  the  effect  of  embedding 
the  space  of  utilities  into  the  space  of  games. 

(1)  (\.gl)P(l.go). 


This  simply  states  that  it  is  preferable  to  play  position  I  in  the  degenerate  game  g,  which 
awards  the  player  in  position  1  a  utility  of  1  than  to  play  position  1  in  the  game  g0  and  get  a 
utility  of  0. 


(2) 


For  any  real  numbers  cand  A;  such  that  c  ^  1, 


1-1 

c 

and  (l,g0) 


,g*);  y  (l,g_t) 


The  first  part  of  this  condition  states  that  a  player  is  indifferent  between  receiving  a  utility  of  k 
for  certain  in  the  degenerate  game  g*,  or  participating  in  a  lottery  which  gives  him  a  utility  of  ck 
with  probability  1/c  and  a  utility  of  0  otherwise.  The  second  part  of  the  condition  states  that  a 
player  is  indifferent  between  receiving  a  utility  of  k  or  — fc,  each  with  probability  1/2,  or  receiv¬ 
ing  a  utility  of  0  for  certain.  This  is  precisely  what  is  meant  by  the  statement  that  the  function 
u  is  an  expected  utility  function. 


Condition  (1)  permits  us  to  normalizet  the  utility  function  v  in  the  natural  way,  and  we 
henceforth  take  v(l,g0)  -  0,  and  v(l,g|)  -  1.  Condition  (2)  then  completes  the  embedding  of 
utilities  into  the  space  of  games,  giving  us  the  following  result. 


PROPOSITION  1:  For  any  real  number  k,  v(l,g*)  —  k. 


"Technically  speaking,  our  definition  allows  there  to  be  more  than  one  game  gk.  since  we  could  allow  different  one- 
element  strategy  sets.  This  difference  is  inessential,  and  can  be  ignored. 

(Recall  that  v  is  determined  only  up  to  choice  of  origin  and  scale. 
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PROOF:  Since  v  is  a  utility  function,  the  first  part  of  (2)  implies  that,  for  c  >  1, 
v(l,«*)  -  vll/c(l ,«£■*);  (1  -  (l/c))(l,g0)].  But  v  is  an  expected  utility  function,  so  the  utility 
of  a  lottery  is  its  expected  utility;  i  e.,  v{l/c(l,&*);  (1  —  (l/c))(l,£o)l  “  (l/c)v(l,&*)  + 
(1  -  (l/c))v(l,£0)  ”  (l/e)v(l,*rt).  Thus,  for  any  k ,  v(l,&*)  -  cvO.g*)  for  c  >  1.  For 
0  <  c  <  1,  let  rf-  1/c,  and  it'  -  ck.  Then  vd,^-)  «  dv(\,gk -),  so  v(l,&*)  -  cvfl.g*)  for 
any  k  and  any  c  >  0.  In  particular,  for  any  k  >  0,  vO.g*)  —  fcv(l,£i)  —  k.  But  the  second 
part  of  (2)  implies  that  v(l,&)  -  -v(l,£_t)  for  any  k ,  and  so  vd,£*)  -  k  for  all  k ,  as  was  to 
be  proved. 

So  far  we  have  imposed  conditions  on  preferences  over  degenerate  games  which  reflect 
the  fact  that  they  are  associated  with  the  underlying  expected  utility  function,  but  we  have  yet 
to  impose  any  conditions  which  reflect  the  zero-sum  nature  of  all  games  in  G.  The  following 
conditions  will  suffice  for  our  purposes. 

(3)  For  all  g  €  G,  (l,g0) 

This  condition  states  that  a  player  is  indifferent  between  getting  a  utility  of  zero  for  certain  in 
the  degenerate  game  g0  or  participating  in  a  lottery  which  gives  him  an  equal  probability  of 
playing  either  position  in  any  two-person,  zero-sum  game  g.  The  motivation  for  this  require¬ 
ment  is  the  idea  that  if  a  given  zero-sum  game  g  yields  an  advantage  to  a  player  in  one  of  the 
positions,  then  it  must  yield  a  corresponding  disadvantage  to  the  player  in  the  other  position. 
An  immediate  consequence  of  condition  (3)  is  the  following: 

PROPOSITION  2:  For  all  g  €  C,  v(U)  -  -v(2 ,g). 

PROOF.  (3)  implies  that  v(l,£0)  »  vfl/2(l,g);  l/2(2,g)l  -  (l/2)v(l,^)  +  (l/2)v(2,g). 
But  vd.£0)  -  0,  so  v(l.g)  -  -v(2„g). 

To  state  the  next  condition,  consider  a  game  g  -  (Si,S2,w)  and  a  given  position  i  €  N. 

for  every  strategy  s  €  S„  define  k(s)  -  kig.i.s )  -  inf  u,(s,t),  where  j  *  /.*  Then  in  addition 

ns, 

to  the  conditions  already  imposed  on  the  preferences,  we  require  the  following. 

(4)  For  any  g  €  G,  i  €  N,  and  s  €  S„  ( i,g )  fV(i.gk,s)). 

If  the  game  g  were  such  that  S,  contained  only  a  single  strategy,  then  condition  (4)  would  sim¬ 
ply  be  a  version  of  the  "sure  thing”  principle,  which  states  that  any  prospect  is  at  least  as  desir¬ 
able  as  the  worst  outcome  which  can  result  from  that  prospect.  As  it  stands,  the  condition  also 
reflects  that  individual  /  is  free  to  choose  any  strategy  s  in  S,.  Condition  (4)  implies  the  follow¬ 
ing  proposition. 

PROPOSITION  3:  For  any  g  €  G, 
v(l.g)  ^  sup  inf  u,(s.t ), 

s(S ,  rfS 2 

and 

v(2,g)  >  sup  inf  u2(/,s). 
siSj  f<S| 


‘So  k(g,  l.s)  -  inf  ui(s.r),  and  k(g.  2. s )  -  inf  u,(r.s). 
l(S2  KS, 


I 


y  (U);  y  <2,*> 
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That  is,  the  utility  of  playing  a  game  is  at  least  as  large  as  the  "security  level"  which  the  game 
provides.  No  further  assumptions  are  needed  to  permit  us  to  reach  the  following  conclusions. 

PROPOSITION  4:  For  any  g  €  G, 

inf  sup  u,(s,t)  >  v(l,£)  >  sup  inf  uAs.t). 

liS2  s€S,  J€S,  ies2 

PROOF:  The  right  hand  inequality  is  simply  the  first  part  of  Proposition  3,  while  the  left 
hand  inequality  follows  from  Proposition  2  and  the  second  part  of  Proposition  3,  together  with 
the  fact  that  t/,(s,t)  -  -u2(s,t). 

A  game  g  is  said  to  have  a  saddlepoint  if  there  exist  strategies  s  €  S,  and  t  €  S2  which 
achieve  the  inf  sup  and  sup  inf  of  Proposition  4,  and  make  them  equal.  For  such  games.  Pro¬ 
position  4  has  the  following  immediate  consequence. 

PROPOSITION  5:  If  g  €  G  has  a  saddlepoint,  then 

v(l,g)  -  max  min  U|(s,t)  =»  min  max  uj(s,f). 
i«s,  its j  tis2  ses, 

The  famous  Minimax  Theorem  of  von  Neumann  116]  establishes  that  if  g  is  the  mixed  exten¬ 
sion  of  a  finite  two-person,  zero-sum  matrix  game,  then  g  has  a  saddlepoint  and  Proposition  5 
applies. 

DISCUSSION 

The  previous  section  establishes  conditions  on  an  individual's  preferences  over  games 
which  yield  the  maximin  value  as  his  utility  for  a  given  game.  Insofar  as  these  conditions  are 
consistent  with  our  other  notions  about  zero-sum  games,  this  shows  that  the  maximin  value  is 
consistent,  both  formally  and  substantively,*  with  the  notion  of  rationality  embodied  in  utility 
maximization.  The  "machinery"  necessary  to  establish  this  result  shoiuld  not,  however,  be  per¬ 
mitted  to  obscure  the  essential  simplicity  of  the  argument. 

The  two  substantive  conditions  on  preferences  are  (3)  and  (4).  Condition  (4)  serves  to 
establish  a  lower  bound  on  the  utility  of  playing  the  game  from  either  position.  Condition  (3), 
by  fixing  the  utility  for  one  position  in  the  game  with  respect  to  the  utility  for  the  other  posi¬ 
tion,  combines  with  (4)  to  produce  an  upper  bound  on  the  utility.  For  a  game  with  a 
saddlepoint  these  bounds  coincide,  and  so  the  utility  is  determined. 

Since  the  maximin  value  is  often  interpreted  as  being  an  unnecessarily  conservative 
assessment  of  a  game  (cf.  Ellsberg's  [4]  "reluctant  duelist"),  it  is  interesting  to  note  that  condi¬ 
tions  (3)  and  (4)  actually  prevent  the  utility  function  from  reflecting  an  excessively  conserva¬ 
tive  attitude.  In  particular,  if  g  is  a  game  without  a  saddlepoint,  then  the  utility  of  playing  g  in  at 
least  one  of  the  positions  must  be  strictly  greater  than  the  maximin  value  for  that  position. 
This  is  because  Proposition  2  requires  that  v(l ,g)  -  — v(2 ,g),  and,  in  a  game  without  a 
saddlepoint,  the  maximin  values  for  each  position  are  not  the  negatives  of  one  another. 


‘Any  numerical  index  Tor  certain  events  can  of  course  be  extended  to  a  function  which  preserves  expected  value  over 
lotteries.  The  purpose  of  this  paper  is  to  investigate  what  substantive  assumptions  about  preferences  need  to  be  made 
in  order  to  interpret  this  extension  as  an  expected  utility  function. 
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Note  that  if  the  class  G  does  include  games  which  have  no  saddiepoint,  then  the  condi¬ 
tions  imposed  in  the  previous  section  do  not  completely  determine  the  utility  function  v.  To 
do  so,  it  would  be  necessary  to  precisely  specify  a  player's  attitude  towards  the  additional  uncer¬ 
tainty  present  in  games  with  no  saddiepoint.  The  same  phenomena  occurs  in  studying  coopera¬ 
tive  games,  in  which  it  is  necessary  to  specify  a  player's  attitude  towards  strategic  risk  (cf.  Roth 
191). 


As  a  final  observation,  note  that,  although  the  maximin  value  is  linear  in  degenerate 
games,  it  is  not  linear  in  arbitrary  games.  Let  g  and  h  be  finite  matrix  games  of  the  same 
dimensions,  let  p  be  a  probability  (i.e.,  p  €  [0,1)),  and  let  g'  —  pg  +  (1  —  p)h  (where  we 
employ  the  usual  conventions  of  matrix  arithmetic).  Then  each  element  of  the  matrix  g'  is  the 
expected  value  of  the  corresponding  element  of  the  random  matrix  defined  by  the  lottery 
[/k;(1  -  p)h).  We  might  conjecture  that  a  player  would  be  indifferent  between  playing  a  given 
position  in  the  game  g'  or  taking  the  same  position  in  the  lottery.  But  for  preferences  obeying 
the  conditions  considered  in  this  paper,  this  is  not  the  case.  That  is,  v  [/»(/, g);  (1  -  p)(i,6)l 
equals  pv(i,g)  +  (1  —  p)v(i,h)  and  is  in  general  not  equal  to  v(/,g’),  for  i  —  1,2.*  In  this 
respect,  the  preferences  considered  here  over  zero-sum  games  resemble  the  preferences  over 
bargaining  games  considered  in  Roth  [111,  [121  more  than  the  preferences  over  games  with 
sidepayments  considered  in  Roth  [8],  (91. 
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ABSTRACT 

Certain  types  of  communication  nodes  can  be  viewed  as  multichannel 
queueing  systems  with  two  types  of  arrival  streams  Data  arrivals  are  character¬ 
ized  by  high  arrival  and  service  rates  and  have  the  ability  to  queue  if  all  service 
channels  are  busy  Voice  arrivals  have  small  arrival  and  service  rates  and  do 
not  have  the  ability  to  wait  when  the  channels  are  full.  Computational  pro¬ 
cedures  are  presented  for  obtaining  the  invariant  probabilities  associated  with 
the  queueing  model 


1.  INTRODUCTION 

The  steady  growth  of  the  data  processing  industry  and  the  telephone  network  over  the  last 
Tour  decades  has  introduced  computer-communications  networks  as  efficient  transportation 
vehicles  for  the  remote  sharing  of  information  data  bases.  Recent  Defense  Communications 
Agency  studies  have  shown  the  desirability  of  a  network  which  integrates  voice,  interactive,  and 
bulk  data  for  the  1980's  (Rosner  [101,  Schmitz,  Saxton,  Huang  and  White  [II]).  Several  addi¬ 
tional  studies  relating  to  information  processing  growth  in  the  next  few  decades  portend  new 
data/voice  services  with  substantially  increased  data  flows  (Rich  and  Schwartz  [8],  Rosner  [9]). 
There  are  several  different  time  division  integration  methodologies  proposed  for  combining 
voice  and  data  demands  over  the  available  channel  bandwidth  (Coviello  and  Vena  [2],  Fischer 
and  Harris  [4],  Frank  and  Gitman  (Si).  A  competitive  allocation  scheme  allows  the  data  and 
voice  calls  to  compete  for  time  slots  using  a  first-come,  first-served  scheme. 

Simulation  models  have  been  developed  to  study  competitive  allocation  schemes;  how¬ 
ever,  because  of  their  expense  and  the  fact  that  rare  events  are  of  interest,  simulation  studies 
are  not  necessarily  appropriate.  Analytical  models  have  also  been  developed  to  describe  com¬ 
petitive  allocation  (Bhat  and  Fischer  [11,  Fischer  [31,  Fischer  and  Harris  [4]).  The  analytical 
models  developed  to  date  all  share  the  common  drawback  of  not  being  tractable  for  problems  of 
reasonable  size.  The  purpose  of  this  paper  is  to  present  an  analytical  model  of  the  dynamic 
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movement  of  voice  and  data  calls  within  an  integrating  device  and  give  a  tractable  solution  pro¬ 
cedure  for  the  model.  The  allocation  scheme  is  that  used  by  Bhat  and  Fischer  [1]  The 
methodology  for  the  solution  scheme  combines  procedures  developed  by  Neuts  16,7)  and 
Wong,  Griffin,  and  Disney  (121. 

2.  THE  MODEL 

There  are  two  classes  of  calls  that  arrive  to  the  service  center:  data  calls  and  voice  calls. 
The  arrival  process  of  data  calls  is  assumed  to  be  Poisson  with  rate  A)  and  the  arrival  process  of 
the  voice  calls  is  assumed  Poisson  with  rate  the  two  arrival  processes  being  independent. 
Although  each  data  call  actually  consists  of  a  random  number  of  discrete  bits,  the  length  of 
each  data  call  is  assumed  to  be  adequately  approximated  by  a  continuous  exponential  random 
variable  with  mean  l//u,  and  the  length  of  the  voice  call  is  assumed  exponential  with  mean 
1  /fi2-  The  characteristics  of  data  calls  are  frequent  arrivals  and  short  length  whereas  the  voice 
calls  have  infrequent  arrivals  with  a  long  service  time.  A  buffer  is  available  that  can  store 
incoming  data  calls  when  all  channels  are  busy.  The  size  of  the  buffer  is  such  that  no  effective 
limit  is  placed  on  the  queue  size.  Voice  calls,  however,  may  not  form  a  queue  so  that  any 
incoming  voice  call  is  lost  whenever  all  channels  are  busy.  The  service  center  has  c  channels 
using  a  FIFO  discipline. 

The  data/voice  integrating  system  is  modeled  as  a  Markov  process  with  a  two  dimensional 
stale  space  given  by 

E  —  {(n.m):  m  =  0, 1 . c  and  n  —  0, 1,  . . . }. 

The  vector  p  (ordered  lexicographically)  denotes  the  steady  state  probabilities  where  p(n.m)  is 
the  steady  state  probability  that  there  are  n  data  calls  in  the  system  and  m  voice  calls  in  the  sys¬ 
tem.  For  ease  of  notation  the  vector  p  is  partitioned  as 

0)  P  -  (Po.  Pi,  ■  ■  ■  ) 

where  the  mth  component  of  p„  is  p(n,m).  The  vector  p  is  the  solution  to  the  equation 

pQ  -  o 

(2)  pl-1 

where  1  and  0  are  vectors  of  all  ones  and  zeros,  respectively,  and  Q  is  the  transition  rate  matrix 
for  the  Markov  process  representing  the  data/voice  system.  The  matrix  Q  is  infinite  and  is 
given  by 


A0  A 
M  |  A,  A 


(3) 


Q  - 


Mc_x 


*c-\  A 

Mc-\  Aq  A 

Mc  Ac  A 
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where  the  submatrices  are  defined,  for  i.j.n  —  0, 1 . c,  by 

|\i  if*  -7. 

^  A(/J)  —  |  q  otherwise; 

nti\  if  *  -  j  and  j  <  c  —  n. 

(5)  ( i.j )  -  (c  -  j)f, (,  if  i  -  j  and  j  >  c  -  n, 

0  otherwise. 

A 2  if  *  —  j  —  1  and  i  <  c  —  »  —  1. 

Hi  if  *  -  j  +  1. 

(6)  A'UJ)  "  *„(*)  if* -7. 

0  otherwise; 

where  a„(i)  is  such  that  the  row  sum  is  zero. 

3.  SOLUTION  PROCEDURE 

The  general  structure  of  the  matrix  Q  is  identical  to  the  type  investigated  by  Neuts  [6] 
and  similar  to  the  finite  system  investigated  by  Wong,  Griffin  and  Disney  [12].  The  solution 
method  as  given  by  Neuts  [6]  is  a  two  step  process.  First,  it  is  necessary  to  find  the  matrix  R 
such  that 

(7)  R2Mc  +  R  Ac  +  A  -  0. 

The  matrix  R  gives  the  relationship 

(8)  pc+*  -  for  k  -0,1 . 

The  second  step  is  to  let  p  =  (p0 . Pc-i)  and  then  solve  for  p  u'l^g  equations 

(9)  p  T  -  0 


p  1  +  Pc-i  R (/  -  R)-'  1-  i 


Mc-\  Ac~ |  +  R  Mc 


It  will  turn  out  that  for  this  communication  problem,  the  matrix  R  of  equation  (7)  is  easy  to 
obtain  whereas  jk  from  equation  (9)  gives  computational  difficulties  which  will  be  overcome  by 
using  a  technique  utilized  by  Wong,  Griffin  and  Disney  [12]. 
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The  matrix  R  is  lower  triangular  since  the  matrices  Mc ,  Ac ,  and  A  are  lower  triangular. 
Therefore,  the  diagonal  elements  of  R 2  are  simply  the  square  of  the  diagonal  elements  of  R 
Thus,  the  following  equation  results  when  considering  the  diagonal  elements  in  (7)  for 
/  -  0 . c 

(10)  r„2  (c  -  i)/i!  -  r„U|  +  ifi2  +  (c  —  / >m i )  +  A|  =  0. 

The  solution  of  (10)  yields 


rcc  ~  ^  |/ (C/U.2  +  A|) 

and  for  /  =  0 . c  —  1 


t a  —  {A  |  +  ifi2  +  (c  —  »)/x  |  —  ((A)  +  ifi2  +  (c  —  /  )/x  i ) 2 
(11)  -4x,(c  -  /)/i|ll/2)/l2(c  -  /)/*,}. 

The  oflT-diagnonal  elements  are  not  quite  as  straight  forward  but  by  considering  the  i.j  element 
with  i  >  j,  equation  (7)  yields 

-  j >M i  I  ',k'k,  +  X,  rlk  At  (k,j)  -  0 

*-/  k-0 


which  in  turn  yields 

I- 1 

“  j)n\[r„(ru  +  r„)  +  £  rikrkl] 

k-i 


+  O'  +  Dm2  'u+ 1  ”  1*1  +  7M 2  +  O  -  j)n\\r,i 
and  thus  for  /  >  j 


02)  ru  -  {(/  +  1>M2 +  (c-y')/ai  2*  ^^/l/lAi  +  7>2  +  ( (1  -  r„  -  r„) ) 

k-l 

where  the  sum  in  equation  (12)  is  defined  as  zero  if  j  —  i  -  1.  Equation  (11)  and  (12)  give  an 
easy  iterative  procedure  where  the  diagonal  elements  of  R  are  first  computed,  then  the  ele¬ 
ments  such  that  7  —  i  —  1  are  computed  then  the  elements  such  that  j  -  /'  -  2,  etc.  In  this 
manner  the  exact  expression  for  R  is  obtained  and  the  solution  of  equation  (9)  remains. 


The  obvious  difficulty  with  (9)  is  that  for  a  typical  system  in  which  c  —  48,  the  dimension 
of  7"  is  2352  x  2352.  In  order  to  obtain  a  solution  to  (9)  it  will  be  reduced  to  solving  a  49  x  49 
system. 


Equation  (9)  can  be  rewritten  as 


(13) 

Mo  +  PiM,  -  0 

(14) 

P*-i A  +  p kAk  +  p*  +  i Mk*\~  0  for  k  -  1,  .. 

.  ,  c  -  2 

(15) 

pc-2A  +  pf_,(^f_i  4-  RMC)  -  6. 

Each  of  the  submatrices  in  the  above  equation  are  of  dimension  (c  +  1)  x  (c  +  1).  New 
matrices  Bk  are  defined  with  dimension  (2c  +  2)  x  (2c  +  2)  by 


Bk 


~Ak  A’1  / 
—  A/^+iA  '  0 


for  k 


1 . c  -  2. 
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It  now  follows  from  (14)  that 


(16)  P*>  -  <P*.  P*+i)»* 
and  thus 

(17)  (p0>  p,)  -  <p,._2,  p,_|)fl(_2flr_3  ...  fl,. 
Combining  equations  (13)  and  (15)  with  (17)  results  in 

f  A0 

(18)  P, +  RMM~\ /)«,_2.  -fl,  | 


0. 


Equation  (18)  gives  a  solution  for  p,  _t  that  is  unique  up  to  a  multiplicative  constant.  Any  such 

solution  is  obtained  and  equation  (16)  is  used  to  obtain  p,._2,  p,  _ j . p0.  The  norming 

equation  of  (9)  is  then  used  and  the  steady  state  probabilities  are  determined. 


A  unique  combination  of  two  previously  determined  solutions  techniques  thus  yields  a 
computational  procedure  that  can  be  used  to  solve  a  class  of  problems  in  the  telecommunica¬ 
tions  field.  Typical  measures  of  effectiveness  can  easily  be  determined  in  the  same  manner  as 
in  Neuts  16).  Thus,  the  utilization  of  analytical  models  for  such  telecommunication  systems  is 
now  feasible. 
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A  NOTE  ON  THE  INFLUENCE  OF  MISSING  OPERATIONS 
ON  SCHEDULING  PROBLEMS 
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Haifa.  Israel 

ABSTRACT 

This  paper  attempts  to  resolve  the  existing  confusion  concerning  missing 
operations.  Scheduling  problems  are  classified  in  two  groups:  (i)  null- 
continuous  (NCI— comprising  the  problems  where  an  optimal  schedule  remains 
optimal  on  replacement  of  arbitrarily  small  processing  times  (existing  opera¬ 
tions)  with  2eros  (missing  operations);  (ii)  null-discontinuous  (NDC)  — 
comprising  those  problems  which  are  not  null-continuous. 


A  "zero  processing  time"  of  an  operation  refers  to  either  of  the  two  following  contingen¬ 
cies:  (i)  An  actual  operation  whose  processing  time  tends  to  zero.  Thus,  if  r„  denotes  the  pro¬ 
cessing  time  of  operation  07  in  job  Jt,  then  for  a  sufficiently  small  positive  number  e,  scheduling 
problems  with  t ,,  =  e  and  -  0  have  the  same  optimal  schedules  (algorithms),  and  e  may  rea¬ 
sonably  be  replaced  with  zero  to  facilitate  calculations,  and  (ii)  A  nonexisting  (missing)  opera¬ 
tion. 


Since  an  infinitesimal  (arbitrarily  small)  processing  time  operation  (i)  has  a  starting  time 
while  a  missing  operation  (ii)  has  not,  the  two  types  have  a  different  effect  on  a  scheduling 
problem  and  must  be  differentiated  to  prevent  ambiguity.  Accordingly,  we  propose  to  designate 
an  infinitesimal  processing  time  operation  as  «,  and  a  missing  operation  as  a  zero. 

Scheduling  problems  involving  operations  with  arbitrarily  small  processing  times  (existing 
operations  where,  for  all  practical  purposes,  the  length  may  be  considered  as  zero,  but  have 
starting  times)  are  basically  the  same  as  those  with  the  usual  strictly-positive  processing  times 
and  do  not  merit  separate  consideration. 

Whether  a  discipline  (flowshop  or  openshop)  allows  missing  operations  or  not  depends  on 
its  definition,  without  clear  preference  for  one  definition  over  another.  However,  for  the  sake 
of  understanding,  the  definition  (whatever  it  is)  must  be  known  and  accepted.  We  propose  to 
define  flowshop  and  openshop  disciplines  allowing  missing  operations.  (Note,  [2]  and  13] 
define  flowshop  allowing  missing  operations  while  [4],  [9]  and  [14]  implicitly  assume  that 
flowshop  does  not  allow  missing  operations.)  Accordingly,  f  or  0  in  the  notation  n/m/y/b *, 
y  €  [F,0]  comes  under  that  definition  and  the  processing  time  of  the  0 ,  operation  of  job 
J,,  (/  —  1,2,  ....  /r,  j  -  1,2 . m),  is  a  nonnegalive  integer,  f(>  ^  0— positive  if  the  opera¬ 

tion  exists  and  zero  if  it  does  not.  In  the  case  of  a  flowshop,  or  an  openshop,  where  all  njobs 
have  m  operations— i.e.,  where  the  processing  times  of  all  mn  operations  are  strictly  positive— 

‘The  notation  is  that  proposed  by  Lenstra  (9)  and  Rinnooy  Kan  114). 
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the  constraint  r,;  >  0  V  ( ij )  is  added,  and  the  designation  becomes  n/ m/y.  t„  >  0/8, 
■y  6  {f.  0). 

Panwalker,  Smith  and  Woollam  {111,  demonstrated  that  the  commonly  held  view  that 
there  exists  an  optimal  permutation  schedule  for  n/3/F/Cmi%  and  n/m/F \  no-wait/ Cmas, 
m  >  3,  turns  out  to  be  correct  only  for  positive  processing  times.*  (The  latter  fact  was  recog¬ 
nized  by  Baker  (21.) 

1.  NULL-CONTINUOUS  AND  DISCONTINUOUS  SCHEDULING  PROBLEMS 

The  general  rule  is  that  optimal  schedule  remain  unchanged  under  small  changes  in  the 
close  neighborhood  of  the  given  processing  times,  providing  there  is  no  replacement  of  «  with 
zero  (deletion  of  an  infinitesimal  operation).  Where  such  replacement  is  resorted  to.  some 
problems  remain  unaffected  while  in  others  the  optimal  schedule  undergoes  total  transforma¬ 
tions;  in  the  latter  category,  an  optimal  algorithm  and  schedule  for  >  0  V  ( i,j )  are  not 
optimal  for  iu  >  0.  Moreover,  there  are  problems  for  which  there  exists  an  efficient  optimal 
algorithm  (belonging  to  P )  for  the  first  case  ( ttj  >  0  V  (/,/'))  while  for  the  second  (f„  >  0),  the 
problem  is  NP-Complete. 

DEFINITIONS 

Null-continuous  (NC)  scheduling  problem  —  one  where  an  optimal  schedule  remains 
optimal  on  replacement  of  arbitrarily  small  processing  times  with  zeros  and  vice  versa.  Thus, 
an  optimal  algorithm  and  schedule  for  tii>  0  V  (ij)  are  also  optimal  for  t„  ^  0. 

Null-discontinuous  (NDC)  scheduling  problem  —  one  which  is  not  null-continuous. 
Thus,  an  NDC  problem  defines  two  different  problems,  one  with  strictly  positive  processing 
times  UtJ  >  0  V  (/,/)  — missing  operations  are  not  allowed)  and  the  other  where  zero  processing 
times  are  allowed  (/,,  ^  0— missing  operations  are  allowed). 

Informally,  a  problem  where  all  operations  with  arbitrarily  small  processing  times  for 
every  machine  can  be  shifted  to  the  beginning  or  the  end  of  the  schedule  without  affecting  the 
measure  of  performance  or  violating  the  constraints— is  NC.  To  show  that  a  problem  is  NDC, 
it  suffices  to  work  out  an  example  where  replacement  of  arbitrarily  small  processing  time  with  a 
zero  results  in  a  different  optimal  schedule. 

The  following  numerical  examples  of  NC  and  NDC  problems  demonstrate  the  fact  that  an 
NDC  problem  defines  two  different  problems— one  where  /„  >  0  and  the  other  where  r„  ^  0. 

EXAMPLE  1; 

NC  instance  —  5/2/F/Cmax+  with  processing  limes  as  per  Table  1. 


•(Ill  was  brought  to  our  attention  by  the  referee.  At  the  time  of  writing  the  first  version  of  the  paper,  we  were 

unaware  of  111)  and  independently  reached  the  same  conclusions. 

fThe  n/2/F/Cm,t  and  n/2/p  no-wait/ Cmix  problems  are  NC  and  NDC.  respectively. 
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TABLE  1  - 

Processing  Times 

Job 

12  3  4  5 

Operatiofbv 

(machine) 

1 

4  «  e  3  2 

2 

2  2  2  «  4 

The  problem  is  NC  since  arbitrarily  small  processing  time  operations  on  the  first  and  second 
machines  can  be  shifted  to  the  beginning  and  the  end  of  the  schedule,  respectively,  without 
violating  any  constraint  or  affecting  Cmax.  Thus,  optima!  schedules  constructed  by  Johnson’s 
algorithm  (8]  remain  optimal  after  replacement  of  «  with  zeros. 

NDC  Instance— 5/2/f,  no-wait/ Cmax  with  processing  times  again  as  per  Table  1.  For 
ty  >  0  V  ( i,j )  Gilmore  and  Gomory’s  algorithm  [5],  (13)  yields  an  optimal  schedule  as 
presented  in  Figure  1.  The  algorithm  itself  is  not  affected  by  replacing  e  with  zeros,  but  the 
resulting  schedule  is  no  longer  optimal  (Figure  2). 


El 

Jl 

J3 

m 

Ell 

m 

0  2  4  6  8  10  11 

FitiCRR  I  Optimal  schedule  for  5/2 / F,  no-wait.  /„  >  0/Cmax,  (e  >  0) 


Figure  2.  Optimal  schedule  for  5/2/ F.  no-wait/ Cmax,  (t  >  0) 


NP-Completeness  of  a  scheduling  problem  is  proved  by  reducing  a  known  NP-Complete 
problem  to  an  instance  of  the  problem  in  question  that  in  many  cases  contains  zero  processing 
times.  In  the  light  of  the  preceding  discussion  only  if  the  problem  in  question  is  NC  then 
results  obtained  for  /y  >  0  V(i.j)  are  also  true  for  ttJ  >  0,  and  vice  versa;  if  NDC,  the  problem 
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may  refer  either  to  t„  >  0  V  ( /, / )  or  /„  >  0  with  a  different  approach  and  specific  optimal  algo¬ 
rithm  and  schedule  in  each  case.  Thus,  before  zero  processing  limes  are  allowed,  the  problem 
in  question  must  be  characterized  as  NC  or  NDC 

2.  CLASSIFICATION  AND  COMPLEXITY  OF  NC  AND  NDC  PROBLEMS 

A  first  attempt  has  been  made  to  classify  some  of  the  scheduling  problems  into  NC  and 
NDC,  and  to  examine  the  complexity  of  each  problem  (the  results  are  obtained  in  Table  2). 
As  explained  previously,  each  NDC  problem  defines  two  different  problems,  one  with  strictly 
positive  processing  times  (r„  >  0  V ( /, / ) )  and  the  other  with  nonnegative  ones,  (f„  >  0).  The 
fact  that  for  a  particular  problem,  the  case  where  i,t  >  0  V  ( i,j )  is  a  special  case  of  r„  ^  0  and. 
thus,  its  complexity  is  lower  than  or  equal  to  that  of  the  latter  (theoretically  speaking,  all  com¬ 
binations  are  possible)  is  demonstrated  in  Table  2. 


TABLE  2  —  Classification  and  Complexity  of  NC  and  NDC  Problems 


Problem 

Complexity 

t„  >  0 

t„  >  o 

NDC 

n/2/F.  no-wait/ Cmax* 
n/3/A,  no-wait/ Cmax 
n/4/F,  no-wait/Cmax 
n/2/F,  r,  >  0/Cmax 
n/2/ F.  tree/  Cmaxt 
n/2/  /■'.tree'/  Cmax§ 

»/3  //7Cmax 

n/ 2/0,  no-wait/Cmax* 

n/2/J,  n,  =  2,  no-wait/ Cmax* 

n/2/F/lC, 

n/2/  F,  no-wait/IC, 

n/ 2/0,  no-wait/IC, 

n/ 2/0/1 C, 

n/2/J/ni  =  2,  no-wait/IC, 
n/l/seq.dep./  Cmax 
(Travelling  salesman) 

0</;2>,  [5] 

•)f 

Unary  NP-Complete,  [12] 
Binary  NP-Complete,  [10] 
Binary  NP-Complete,  [10] 
0(n  log  n),  [16] 

Unary  NP-Complete,  [10] 
Unary  NP-Compleie,  [15] 
Unary  NP-Complete.  [15] 
Unary  NP-Complete,  [4] 

■J 

0 

9 

Unary  NP-Complete,  [14] 

Unary  NP-Complete,  [151 
Unary  NP-Complete,  [4] 
Unary  NP-Complete,  [12] 
Binary  NP-Complete,  [10[ 
Binary  NP-Complete.  [10] 

9 

Unary  NP-Complete.  [4] 
Unary  NP-Complete,  [15] 
Unary  NP-Complete.  [15] 
Unary  NP-Complete,  [4] 
Unary  NP-Complete,  [4] 
Unary  NP-Complete,  [1] 
Unary  NP-Complete,  [1] 
Unary  NP-Complete.  [4] 
Unary  NP-Complete. [14] 

U 

z 

n/2/  F/CmM 
n/2/J,  n,  <  2/  Cmax 
n/ m/ 0/8 

0(n  log  n) ,  [8] 

0( zi  log  n ),  [7] 

Depends  on  the  number  of  machines  (m)  and  the  regular 
measure  of  performance  (8). 

'Flowshop.  jobshop  and  openshop  disciplines  are  defined  allowing  zero  processing  limes 
+  Prize  carrying  open  problem.  [6|,  (9|.  |I0| 

llree— Tree  precedence  relations  where  job  7,  may  start  only  after  job  7,  has  been  completed 

§lree'  -  Tree  pr  cedence  relations  where,  on  each  machine,  job  7,  may  start  only  after  7  on  this  machine 

has  been  completed 
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